ScrapeIt Search

A self-built search engine. Add sites to start crawling.

Add a Website

Enter a URL to seed the crawler. ScrapeIt will scrape its metadata, images, and discover linked pages automatically.

Protocol (https://) will be added automatically if missing.

How crawling works:
ScrapeIt fetches the page HTML, extracts title, description, keywords, Open Graph tags, favicon, all images (with alt text), and all links. It then queues discovered links for crawling. Duplicate pages (same URL after normalization) are skipped. The crawler runs continuously in the background.

Search Engine Stats

Indexed Sites
Indexed Images
Unique Domains
Links Found
Queue Size
Failed Sites
Loading...
Recently Crawled

No sites crawled yet

All Sites

Domain Title Status Images Links Last Crawled

Browse Domains

Random Page

Bookmarks

No bookmarks yet. Click ☆ Save on any search result.

Search History

No history yet.

Terms of Service

Last updated: March 2026

What we collect

SearchIt collects only two categories of data:

  • Usage data — how many times the search is used, what pages are visited, and what terms are searched. Your IP address is never stored — it is immediately hashed (SHA-256) and only the first 16 characters of that hash are kept, making it impossible to trace back to you.
  • Site data — the URLs, titles, descriptions, and metadata of websites you submit to be crawled. This is the search index itself.

What we do NOT collect

  • Your name, email, or any personal information
  • Your raw IP address
  • Cookies or tracking identifiers
  • Any data from websites you visit through search results

Why we collect usage data

Usage statistics (total searches, page views, popular queries) are used solely to understand how the search engine is being used and to improve it. This data is only visible to the site administrator.

Data retention

Raw event data is automatically deleted after 30 days. Aggregated counts (daily totals, top queries) are kept indefinitely but contain no personal information.

Third parties

We do not sell, share, or send any data to third parties. Ever.

Contact

If you have any questions, open an issue at github.com/Simonko-912/SearchIt.