Skip to content

Commit

Permalink
ads;)
Browse files Browse the repository at this point in the history
  • Loading branch information
altonalexander committed Feb 28, 2023
1 parent 82e35a1 commit f753bfd
Show file tree
Hide file tree
Showing 2 changed files with 95 additions and 12 deletions.
75 changes: 65 additions & 10 deletions analysis-clustering-keywords/cluster-keywords-with-valueserp.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,31 @@
"# Cluster bulk list of keywords with valueserp\n",
"\n",
"\n",
"Quickly cluster a large quantity of keywords using SERP results"
"Quickly cluster a large quantity of keywords using SERP results\n",
"\n",
"A summary and conclusion at the bottom including next steps and alternate methods\n",
"\n",
"\n",
"### About Alton\n",
"\n",
"Follow me for more data and tutorials\n",
"\n",
"- twitter: https://twitter.com/alton_lex @alton_lex\n",
"\n",
"- linkedin: https://www.linkedin.com/in/altonalexander/\n",
"\n",
"\n",
"### About Data Winners\n",
"\n",
"Join the conversation:\n",
"\n",
"- private Discord community\n",
"\n",
"- Video tutorials\n",
"\n",
"- Feedback and support on this and other scripts\n",
"\n",
"Join now: https://datawinners.gumroad.com/l/data-analytics-for-seo"
]
},
{
Expand Down Expand Up @@ -2219,13 +2243,6 @@
"topic_groups_dictdf.to_csv(\"./output-clustered-queries.csv\", index=False)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": []
},
{
"cell_type": "markdown",
"metadata": {},
Expand All @@ -2234,12 +2251,50 @@
"\n",
"In this sample we input over 8000 keywords and clustered them into 1333 clusters\n",
"\n",
"3603 were not clustered and live in their own (cluster size of 1)\n",
"3603 were not clustered and live in their own (cluster size of 1 keyphrase each)\n",
"\n",
"### Next steps:\n",
"\n",
"- SERP clustering is one of many methodologies for clustering keywords\n",
"\n",
"- SERP clustering can be adjusted beyond just URLs that match to include intent, related searches, etc.\n",
"\n",
"- More sophisticated clustering methods include NLP\n",
"\n",
"- Less sophisticated clustering methods include n-grams\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"\n",
"\n",
"# Questions & support\n",
"\n",
"Feel free to reach out to me on https://twitter.com/alton_lex"
"Feel free to reach out to me on https://twitter.com/alton_lex\n",
"\n",
"\n",
"### About Alton\n",
"\n",
"Follow me for more data and tutorials\n",
"\n",
"- twitter: https://twitter.com/alton_lex @alton_lex\n",
"\n",
"- linkedin: https://www.linkedin.com/in/altonalexander/\n",
"\n",
"\n",
"### About Data Winners\n",
"\n",
"Join the conversation:\n",
"\n",
"- private Discord community\n",
"\n",
"- Video tutorials\n",
"\n",
"- Feedback and support on this and other scripts\n",
"\n",
"Join now: https://datawinners.gumroad.com/l/data-analytics-for-seo"
]
}
],
Expand Down
32 changes: 30 additions & 2 deletions analysis-onpage/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,30 +8,36 @@ All of my scripts are open source.

If you would like to support and/or follow along then please consider joining the step-by-step course here: https://datawinners.gumroad.com/l/data-analytics-for-seo

---

## Scraping Data for Programmatic SEO

Collecting data from a page using beautiful soup

[scraping-tables-for-programmatic-seo.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-onpage/scraping-tables-for-programmatic-seo.ipynb)
[analysis-onpage/scraping-tables-for-programmatic-seo.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-onpage/scraping-tables-for-programmatic-seo.ipynb)


## Entity extraction

Using spacy to extract entities and parts of speach

[analysis-onpage/onpage-entity-extraction.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-onpage/onpage-entity-extraction.ipynb)


## Bulk Analysis - Schema Extraction for a SERP

Extract schema from one url or from an entire serp

[analysis-onpage/onpage-schema-analysis.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-onpage/onpage-schema-analysis.ipynb)



## Bulk Analysis - Internal Link Extraction

Getting the most popular internal links for an entire site

[analysis-internal-links/internal-link-graph.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-internal-links/internal-link-graph.ipynb)




Expand All @@ -41,8 +47,30 @@ Given a search query, scrape the top pages for all the headers (h2,h3,h4, etc).

Return the results as a table of sections:

[SERP-to-sections-jam-session.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-serp-scrape-top-results-for-metadata-summary/SERP-to-sections-jam-session.ipynb)
[analysis-serp-scrape-top-results-for-metadata-summary/SERP-to-sections-jam-session.ipynb](https://github.com/FrontAnalyticsInc/data-winners/blob/main/analysis-serp-scrape-top-results-for-metadata-summary/SERP-to-sections-jam-session.ipynb)


---


### About Alton

Follow me for more data and tutorials

- twitter: https://twitter.com/alton_lex @alton_lex

- linkedin: https://www.linkedin.com/in/altonalexander/


### About Data Winners

Join the conversation:

- private Discord community

- Video tutorials

- Feedback and support on this and other scripts

Join now: https://datawinners.gumroad.com/l/data-analytics-for-seo

0 comments on commit f753bfd

Please sign in to comment.