Bitcoin Forum
June 25, 2025, 06:16:54 PM *
News: Pizza day contest voting
 
   Home   Help Search Login Register More  
Poll
Question: Should I create translations of the website?
Yes - 18 (69.2%)
No - 8 (30.8%)
Total Voters: 26

Pages: « 1 2 3 4 5 6 [7]  All
  Print  
Author Topic: Talksearch.io - Advanced Bitcointalk Search Engine  (Read 2795 times)
NotATether (OP)
Legendary
*
Offline Offline

Activity: 2002
Merit: 8642


Search? Try talksearch.io


View Profile WWW
June 12, 2025, 08:21:27 AM
Last edit: June 13, 2025, 04:13:35 AM by NotATether
Merited by cygan (3), klarki (2)
 #121

New content update v1.0.3 and backend update v1.0.2 published

These updates add advanced search capability to Talksearch.

Search features:

  • Searching by user ID, topic, board, and date is now possible.

App features:

  • Added dark mode theme.
  • Add advanced search section to the front page.
  • Advanced search parameters appear in the query string with colons. For example: "bitcointalk author_uid:35".

48 million posts have been index now. We are approaching indexing completion.



Have you ever thought about a post/topic author rating system?

A higher ranked user - more posts, merit, ranking - has passes the filters. The rest have to go through tighter filters. This may help reduce the number of posts analyzed, and help filter better.

I don't like such a system because it will bias search results for users with a lot of merit.

██
██
██
██
██
██
██
██
██
██
██
██
██
... LIVECASINO.io    Play Live Games with up to 20% cashback!...██
██
██
██
██
██
██
██
██
██
██
██
██
cygan
Legendary
*
Online Online

Activity: 3570
Merit: 10441


Top-tier crypto casino and sportsbook


View Profile WWW
June 12, 2025, 08:56:01 AM
 #122

New content update v1.0.2 published
✂️

very nice to see another update of your search engine Smiley
to update the translated threads from (taufik123, satscraper, Abdulzuruku01, katanic97, Adiljutt156, mela65, r_victory, GazetaBitcoin, Danica22 and Porfirii) i would ask you to update your op and the changelog - but you were probably planning to do that anyway Wink

██████▄██▄███████████▄█▄
█████▄█████▄████▄▄▄█
███████████████████
████▐███████████████████
███████████▀▀▄▄▄▄███████
██▄███████▄▀███▀█▀▀█▄▄▄█
▀██████████▄█████▄▄█████▀██
██████████▄████▀██▄▀▀▀█████▄
█████████████▐█▄▀▄███▀██▄
███████▄▄▄███▌▌█▄▀▀███████▄
▀▀▀███████████▌██▀▀▀▀▀█▄▄▄████▀
███████▀▀██████▄▄██▄▄▄▄███▀▀
████████████▀▀▀██████████
 BETFURY ....█████████████
███████████████
███████████████
██▀▀▀▀█▀▀▄░▄███
█▄░░░░░██▌▐████
█████▌▐██▌▐████
███▀▀░▀█▀░░▀███
██░▄▀░█░▄▀░░░██
██░░░░█░░░░░░██
███▄░░▄█▄░░▄███
███████████████
███████████████
░░█████████████
█████████████
███████████████
███████████████
██▀▄▄▄▄▄▄▄▄████
██░█▀░░░░░░░▀██
██░█░▀░▄░▄░░░██
██░█░░█████░░██
██░█░░▀███▀░░██
██░█░░░░▀░░▄░██
████▄░░░░░░░▄██
███████████████
███████████████
░░█████████████
taufik123
Legendary
*
artcontest
Online Online

Activity: 2926
Merit: 2017


Rollbit.com | #1 Solana Casino


View Profile
June 12, 2025, 01:18:45 PM
 #123

New content update v1.0.2 published
Shouldn't this be a v1.0.3 update, because there was already a v1.0.2 update

New app update v1.0.2 published

 
█▄
R


▀▀██████▄▄
████████████████
▀█████▀▀▀█████
████████▌███▐████
▄█████▄▄▄█████
████████████████
▄▄██████▀▀
LLBIT▀█ 
  TH#1 SOLANA CASINO  
████████████▄
▀▀██████▀▀███
██▄▄▀▀▄▄████
████████████
██████████
███▀████████
▄▄█████████
████████████
████████████
████████████
████████████
█████████████
████████████▀
████████████▄
▀▀▀▀▀▀▀██████
████████████
███████████
██▄█████████
████▄███████
████████████
█░▀▀████████
▀▀██████████
█████▄█████
████▀▄▀████
▄▄▄▄▄▄▄██████
████████████▀
........5,000+........
GAMES
 
......INSTANT......
WITHDRAWALS
..........HUGE..........
REWARDS
 
............VIP............
PROGRAM
 .
   PLAY NOW    
nutildah
Legendary
*
Offline Offline

Activity: 3388
Merit: 9578



View Profile WWW
June 13, 2025, 01:11:39 AM
 #124

I think they get the overall sentiment, especially the last one, but it would be unwise to rely only on a LLM as a universal quality score.

Agreed -- what is interesting or relevant to a LLM might not be so for people actually utilizing your search engine.

Additional measures must be taken in place to identify e.g. application posts, obviously AI-generated posts, and such in order to not return them in search results.

I like the initiative you're talking here. Whats interesting is that, last I checked, Google doesn't filter out AI-generated content, but it may do so in the future if it turns out that nobody wants to read such content, thereby making their search results not as accurate or relevant to the query as could potentially be. Seems like it would be super easy to game SEO ranking with AI content, so I don't know why they wouldn't attempt to block it.

██████▄██▄███████████▄█▄
█████▄█████▄████▄▄▄█
███████████████████
████▐███████████████████
███████████▀▀▄▄▄▄███████
██▄███████▄▀███▀█▀▀█▄▄▄█
▀██████████▄█████▄▄█████▀██
██████████▄████▀██▄▀▀▀█████▄
█████████████▐█▄▀▄███▀██▄
███████▄▄▄███▌▌█▄▀▀███████▄
▀▀▀███████████▌██▀▀▀▀▀█▄▄▄████▀
███████▀▀██████▄▄██▄▄▄▄███▀▀
████████████▀▀▀██████████
 BETFURY ....█████████████
███████████████
███████████████
██▀▀▀▀█▀▀▄░▄███
█▄░░░░░██▌▐████
█████▌▐██▌▐████
███▀▀░▀█▀░░▀███
██░▄▀░█░▄▀░░░██
██░░░░█░░░░░░██
███▄░░▄█▄░░▄███
███████████████
███████████████
░░█████████████
█████████████
███████████████
███████████████
██▀▄▄▄▄▄▄▄▄████
██░█▀░░░░░░░▀██
██░█░▀░▄░▄░░░██
██░█░░█████░░██
██░█░░▀███▀░░██
██░█░░░░▀░░▄░██
████▄░░░░░░░▄██
███████████████
███████████████
░░█████████████
NotATether (OP)
Legendary
*
Offline Offline

Activity: 2002
Merit: 8642


Search? Try talksearch.io


View Profile WWW
June 13, 2025, 04:13:01 AM
Last edit: June 13, 2025, 01:10:30 PM by NotATether
 #125

New content update v1.0.2 published
Shouldn't this be a v1.0.3 update, because there was already a v1.0.2 update

New app update v1.0.2 published

You're right - But only the frontend would be v1.0.3, because there was no v1.0.2 update for the backend.

I like the initiative you're talking here. Whats interesting is that, last I checked, Google doesn't filter out AI-generated content, but it may do so in the future if it turns out that nobody wants to read such content, thereby making their search results not as accurate or relevant to the query as could potentially be. Seems like it would be super easy to game SEO ranking with AI content, so I don't know why they wouldn't attempt to block it.

I am fortunate that my preliminary tests can detect AI to a similar degree of accuracy to other types of spam.



New content update v1.0.4 published

This is a minor update that adds missing time controls for the Date From and Date To filters.

All posts up to March 2025 have now been indexed. I am actively working on enabling real-time indexing.

██
██
██
██
██
██
██
██
██
██
██
██
██
... LIVECASINO.io    Play Live Games with up to 20% cashback!...██
██
██
██
██
██
██
██
██
██
██
██
██
NotATether (OP)
Legendary
*
Offline Offline

Activity: 2002
Merit: 8642


Search? Try talksearch.io


View Profile WWW
June 14, 2025, 05:36:18 AM
Last edit: June 14, 2025, 03:17:28 PM by NotATether
 #126

All posts from March - June 2025 are now being uploaded to the index, while I continue to contrive an automated solution to this problem.

Edit: this batch was uploaded with wrong dates which will cause search errors, and has been deleted and is being reuploaded again.

Edit 2: All done.

I want to implement a spam score as soon as possible, but I'm still not exactly sure how I will do that without re-indexing all the posts. At any rate, I will figure something out.

██
██
██
██
██
██
██
██
██
██
██
██
██
... LIVECASINO.io    Play Live Games with up to 20% cashback!...██
██
██
██
██
██
██
██
██
██
██
██
██
Rashlyowl
Jr. Member
*
Offline Offline

Activity: 42
Merit: 67

rākā - ₿ - vṛṣabha


View Profile
June 16, 2025, 04:11:48 AM
 #127

Hey bros @NotATether, is it possible to implement pagination/paging directly on the site?



When I've gone too far, I want to go back to the page I want, but opening previous pages is a barrier for me. The solution is actually easy, just by changing:

Current page
Code:
https://talksearch.io/search?q=Bitcointalk&page=9

To

Page I want to see
Code:
https://talksearch.io/search?q=Bitcointalk&page=4

But it makes me a bit annoyed, after all, pagination can improve user experience to a better level.
NotATether (OP)
Legendary
*
Offline Offline

Activity: 2002
Merit: 8642


Search? Try talksearch.io


View Profile WWW
June 16, 2025, 06:17:28 AM
Last edit: June 16, 2025, 09:37:08 AM by NotATether
Merited by Rashlyowl (1)
 #128

Hey bros @NotATether, is it possible to implement pagination/paging directly on the site?



When I've gone too far, I want to go back to the page I want, but opening previous pages is a barrier for me. The solution is actually easy, just by changing:

Current page
Code:
https://talksearch.io/search?q=Bitcointalk&page=9

To

Page I want to see
Code:
https://talksearch.io/search?q=Bitcointalk&page=4

But it makes me a bit annoyed, after all, pagination can improve user experience to a better level.

As you said, this sort of change is very easy to do, and I will make sure to find some time with it.

Due to a lack of software available on Github for this purpose, I'm currently busy building a project for calculating "embeddings" in text classification LLMs. It is a software that is blatantly missing from open-source repositories, and essential for anybody who is building a search application without a paid-for Elasticseearch subscription, which are expensive, even though they package AI search directly.

The hope is that some others in the AI community will find it useful. So I am not indexing new posts for now as the spam score calculation depends on this. Or maybe I will do that in parallel as it will benefit all. Technically, i can do it from a terminal using python scripts, but who likes doing it that way?

Edit: This is what it's going to look like: https://bert-embedding-playground.lovable.app/ - it's designed to be self-hosted. This is just the frontend though, I haven't written much of the backend yet. And even the frontend was made by AI, because I suck at designing HTML by hand. Undecided



(Modified again to avoid double-posting)

New app update v1.0.5 published

This is a minor update that adds detailed pagination to search results.

██
██
██
██
██
██
██
██
██
██
██
██
██
... LIVECASINO.io    Play Live Games with up to 20% cashback!...██
██
██
██
██
██
██
██
██
██
██
██
██
NotATether (OP)
Legendary
*
Offline Offline

Activity: 2002
Merit: 8642


Search? Try talksearch.io


View Profile WWW
June 22, 2025, 09:08:19 AM
Last edit: June 23, 2025, 08:25:15 AM by NotATether
 #129

In the near future, after I set up the scraper to run automatically on the index, I will set up an API for retrieving posts, topics and users and enable limited access to it.

The Elasticsearch cluster is healthy at the moment and is running without issues.

Thank you for using Talksearch.



PSA: Date filtering older than September, 2020 posts does not work properly. I will create a new index to fix this.



New content update v1.0.3 published

This is a minor update that implements a workaround to the date filtering bug described above. A long-term re-indexing will be done at a future time.

██
██
██
██
██
██
██
██
██
██
██
██
██
... LIVECASINO.io    Play Live Games with up to 20% cashback!...██
██
██
██
██
██
██
██
██
██
██
██
██
Pages: « 1 2 3 4 5 6 [7]  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!