I am trying to make a search engine for Bitcointalk posts, since Google and the built-in one are so bad.
List all the features you want in a search engine here.
For now, I am scraping topics from the forum using my bot. I made sure to identify the requests as coming from me in my program so that the admins know where this traffic is coming from.
It doesn't look like it's exceeding the threshold of one request per second so that's good.
Private boards are not being scraped. The scraping is being done as a guest.
That sounds like an awesome project! A dedicated Bitcointalk search engine would be a huge improvement over Google and the built-in search. Here are some features that would make it really useful.
1. Better Search Filters
Search by User – Find posts by a specific username.
Date Range – Filter results by specific timeframes.
Board/Category Search – Only show results from selected sections like Altcoins, Services, or Mining.
Keyword Relevance – Prioritize results based on keyword frequency and context.
Post Type Filtering – Option to search only for thread titles or include replies.
2. Faster and Smarter Results
Real-time indexing so new posts show up quickly.
Cached results for faster searches without hitting the server hard.
3. Better Post Previews
Show snippets of posts before clicking.
Preserve BBCode formatting so posts don’t look broken.
4. Sorting by Popularity & Engagement
Sort by most replies, most views, or most merit to find valuable discussions faster.
5. Mobile-Friendly & Dark Mode
Clean, simple design that works well on mobile.
Dark mode support for easier reading at night.
6. API for External Use
A public API so devs can integrate search results elsewhere.
7. Save & Bookmark Searches
Ability to save frequent searches and access them later.
It’s great that you're being mindful of forum limits and not scraping private boards. Keeping traffic within reasonable limits should help avoid any admin issues. Definitely excited to see how this evolves keep us posted! 🚀