An SEO data engine that’s always on

Similar.ai collects and analyzes all of the data you need to understand your search performance, your competitors’, and which changes will have the most impact on your site.

Similar.ai automates the research you need to move the needle

Keyword data

Search volume
Topics
Keywords
Keyword ranking
Misspellings
Competitive keywords

Page & site data

Listings
Stock levels
Categories
Meta-data
Text data
Semantic data
Search engine result pages

Knowledge graph data

Topic labels
Topic cannibalisation
Same topic keywords
Total demand for a topic
Misspellings & synonyms

Matched pages

Traffic & impressions
Demand per page
Keywords for which a page ranks
Keywords for which a page could rank
Page uniqueness increased by content
Unbranded traffic opportunity

Page level actions

Pages that should be kept
New pages that should be added
Page redirects
No indexed pages
Canonical tags
No search engine users

Intelligent APIs to optimize the structure of your site

Link to relevant pages that can deliver incremental unbranded traffic with our internal linking API.

Hide superfluous pages and redirect to valuable ones with our cleanup API.

Add better headings, titles, descriptions, and FAQs to your site with our content API.

See a macro-view of how your site maps to real customer demand

Similar.ai uses natural language processing (NLP) to auto-generate fresh content that best showcases your products and listings.

01

Similar.ai’s machine learning models cluster keywords to show you topics, or groups of keywords that users think mean the same thing.

02

Our cloud crawler indexes your site like Google does, showing which pages rank and which could receive incremental unbranded, organic traffic.

03

Your unique knowledge graph maps our findings with your current pages and identifies gaps that should be filled.

The result: a roadmap to SEO optimization at scale

RVshare chose Similar.ai’s always-on SEO Research Engine to enrich our understanding of millions of keywords, topics, search intents and entities, both those for which we rank and those which we could, to optimize our pages & content, because it was the only product-led SEO platform that could handle the scale at which we work.

Martijn Scheijbeler
VP Marketing, RVshare

FAQs

What is a topic?

A topic is a set of keywords for which users would expect to see the same results.

What is a canonical keyword?

The canonical keyword is the keyword that is most representative of the topic as a whole.

What is total demand?

The total demand is the total number of searches done for that topic in a local market. It combines all the searches for which users expect to see similar results.

Could I just add up the search volume of keywords to get total demand?

It’s not quite that simple. Often keyword tools give the same search volume for misspelled keywords as for correctly spelled keywords and give the same search volume for re-orderings of the same keywords. Our platform corrects for that to give you the most accurate way to prioritize your pages around the potential to deliver unbranded organic traffic.

A lot of our Google Search Console keywords are misspelled. Can you help us filter these out?

Yes, our SEO research engine identifies misspellings and what the correctly spelled term should be.

Couldn’t we just Search Console UI instead?

Our research engine finds all of your pages, both those that rank and those that don’t. We then identify the keywords for which you rank, and the keywords for which you could, and the total demand of topics matched to your pages. Search Console UI only tells you the keywords for which you rank today, not the keywords for which you could rank. It only gives you 1,000 results. It doesn't tell you the traffic opportunity for your pages. It also doesn't let you easily handle large segments of 100,000s or millions of pages.

Couldn't we just Search Console API instead?

Large sites using the search console API typically miss 2/3 of their data. Check how to (remove GSC API limits)[https://similar.ai/blog/closing-google-search-console-sampling-gap/]. There are widespread misunderstandings about how the API works. If you're unsure, talk to us! We'll check how much data you're missing and explain how to fix the GSC API limits, like we do for our other customers.

Our competitive keyword universe covers millions of keywords. Can you handle that?

Yes, with ease. Similar.ai works with product-led SEO teams who need to deliver product features to scale SEO across the whole of their site. Our SEO research engine covers pages segments of 100,000s or millions of pages.

Does your research engine stay up-to-date?

Yes, the research engine is always on. We pull in the latest keywords for which you rank daily, we check Googlebot crawls daily, and update relevant competitive topics either monthly or every second month. We typically update page data, such as the number of listings per page or the listing relevance, either daily or a few times a week.

We have some internal tooling we use for reporting and paid search. Is it possible to integrate the SEO data engine into BigQuery and our data pipelines?

Sure, a number of customers are ingesting our full dataset into their existing data pipelines today. Let’s figure what data you need together: book a time to [talk to us].

We use different suppliers to cover ranking keywords, competitive keywords, topic clustering, crawling and knowledge graphs. Do you mean that you do all of these things, connect them together and make it easier for us to leverage them to make changes to our site?

Yes.

Why build a single cohesive grid of SERP, intent, domain, ML models, knowledge graph data, page data and listing data?

Our mission is to enable more of your users to experience more value faster, by fixing derailed customer journeys, and making sure every page delivers the best experience tailored to the needs lots of users have. Through all the models we’ve built, tracked daily with millions of data points and millions of ranking results we are able to group all the keywords and intent into the topics which will deliver your site organic revenue, we deliver a higher quality of user to your site. We help you control the landing page experience by improving internal navigation and classification detection of how relevant the results are to the Google query. This helps you convert traffic at a higher rate. Better quality traffic with a better conversion creates a flywheel for search engine users.

See how it all comes together on a personalized demo of Similar.ai