Bitcoin Forum
July 21, 2024, 04:20:27 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
  Home Help Search Login Register More  
  Show Posts
Pages: [1] 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 ... 551 »
1  Other / Meta / Re: [BETA] NEW ninjastic.space search on: Today at 04:07:22 AM
Do you have any idea how many years it took for search engines to do what you're asking for?
A system of this kind is very complex to build. That's why it remains difficult to destroy Google and its algorithm.

The objective here is obviously not to "destroy" Google, but to make a search system for Bitcointalk that is more useful than Google's site:bitcointalk.org (and the native search).

So yes, I know it's going to be hard, but I don't see it being that complex. For one thing, we have only millions of posts, not billions of pages like google. Second, we won't have as many users, and consequentially people trying to "abuse" the system compared to Google (only a few thousand people, instead of millions of people using the search).

Regarding spam, I don't think anyone's going to shed any tears over spam posts not being indexed. For instance, I plan to (and am currently in the process of) index the entire Politics and Society. However, you can eliminate 95% of the spam by enforcing a minimum character count in posts like in sig campaigns, as well as using a regex to filter out sig/bounty application posts. Google's bias accusations are literally because they tweak the ranking of certain kinds of pages that they are offended by - I have no such plans.

Now yes, I know that if you're trying to conjuncture your own search software, you will run into the limitations you just described.

However, we are very fortunate to have open-source search software written by people who do have years of experience in the field, for example Apache Lucene (which coincidentally I have experience with but in a completely different industry). Because of this, sites can roll up their own search, in case Google's results are insufficient. Example: Hacker News.

The above is not written to discourage anybody from creating their own search algorithms, it just shows the requirements that need to be in place to make an "intelligent" search. There's nothing wrong with the current implementation if you're just using it to search for someone's (or your own) post, and I think that is what TryNinja is trying to make here.

I can see much value there will be for an intelligent search engine if somebody manages to pull it off, especially with other repositories like bitcoin-dev (the old version) in dire need of indexing as well.
2  Bitcoin / Project Development / Re: Bitcointalk Search Project on: July 20, 2024, 05:56:48 PM
There is no way for you to track which post I may edit.  You would need to rescan all my 22k posts on a regular basis, or use some AI to determine what I may have edited.  The number of seconds in a day is limited to about 4x my number of posts, and I'm sure there are more than three other people posting.

That's what I'm saying. I will have to manually go through all the user's posts (but hey, at least I'll have their message and topic IDs this time!) but at least I know that statistically, I will only need to search a few users at a time, because editing is infrequent compared to posting.

It's not perfect obviously - edited posts probably won't be indexed for several hours like that* - but it's the best I can think of.

* When the initial download is finished
3  Other / Meta / Re: [BETA] NEW ninjastic.space search on: July 20, 2024, 05:30:32 PM
And what do you expect it to find when you search "I like trains"? Or any actual example, for that matter.

This is going to sound controversial but I think that meaningless posts should not be indexed, or at least not returned. The algorithm should be prioritizing information over exact matches.

Open DuckDuckGo or any other search engine and search for it. After the videos, images etc you will start seeing results like this:



The mission of a search engine is to help people find what they are looking for. This is almost always stuff like answers, facts, and other kinds information with a semantic load. There is a lot of spam on Bitcointalk that nobody wants to read on the main site. So as much as possible, search engines must avoid returning spammy posts as results because this will discourage people from using them, as they will not find the kind of information they are looking for.

PS. even merit cannot be used to bias the posts because author merit does not apply if the post in question has no semantic load, and post merit allows for merit farms to manipulate search results by meriting spammy posts.
4  Bitcoin / Project Development / Re: Bitcointalk Search Project on: July 20, 2024, 05:03:19 PM
We have seen a lot of threads with the word "Reseved" to get edited in the future. So, here we have another challenge for the project. How to deal with edited posts?

This is going to be a live search engine, so every post is going to be kept up to date, and removed if the original post is removed as well.

Edit: I will implement a heuristic that tracks when the last time a person logged into their account, and correlates that to the frequency of posts they made on particular days versus now, Banned users and inactive accounts that haven't posted for 120 days (or whatever the "This account recently woke up from a long period of inactivity" threshold is) can be excluded, so this leaves a small subset of users who will might actually make an edit, out of the number daily active users, whose user IDs should then be prioritized to scan for edits and deletions.

The number of new posts daily >>> The number of edited posts daily

*I do not currently track the "last edited" time because it is an unreliable indicator for determining whether a given post might be edited in the future.
5  Other / Meta / Re: [BETA] NEW ninjastic.space search on: July 20, 2024, 05:01:02 PM


IMO, I think this is going to be a problem.

Exact results are pretty much useless as nobody is going to be searching with entire post contents.

Semantic meaning is not taken into context either, just the lexical word, so we are not only getting results for trains here but about a ton of "training" posts from Gambling Discussion.

I'm sure we would love to have google-like search results but we need to figure out the secret sauce first.
6  Bitcoin / Bitcoin Discussion / Re: Craig Wright is missing, over charge to face probe for alleged perjury on: July 19, 2024, 05:11:40 PM
CSW decided to leave the UK to avoid getting caught in a CRIMINAL charge

*the common sense theory is the judge sought out to check if his passport was used at an airport without csw consulting the courts, rather than CSW consulting the court with a forwarding address of residency due to wanting to shy away from public eye

He's eventually going to be served, unless he wants to find himself on an INTERPOL Red Notice.

Does even INTERPOL have to tell him that he is not Satoshi? SMH.
7  Bitcoin / Bitcoin Discussion / Banks, Airlines, and Hospitals are all down today on: July 19, 2024, 05:09:37 PM
...but not Bitcoin!

https://techcrunch.com/2024/07/19/what-we-know-about-crowdstrikes-update-fail-thats-causing-global-outages-and-travel-chaos/

Quote
A faulty software update issued by security giant CrowdStrike has resulted in a massive overnight outage that’s affected Windows computers around the world, disrupting businesses, airports, train stations, banks, broadcasters and the healthcare sector.

CrowdStrike said the outage was not caused by a cyberattack, but was the result of a “defect” in a software update for its flagship security product, Falcon Sensor. The defect caused any Windows computers that Falcon is installed on to crash without fully loading.

“The issue has been identified, isolated and a fix has been deployed,” said CrowdStrike in a statement on Friday. Some businesses and organizations are beginning to recover, but many expect the outages to drag on into the weekend or next week given the complexity of the fix. CrowdStrike CEO George Kurtz told NBC News that it may take “some time for some systems that just automatically won’t recover.” In a later tweet, Kurtz apologized for the disruption.

Bitcoin never takes a break. Its architecture prevents someone from faulting millions of nodes worldwide. It also resists bugs like the one CrowdStrike just made on Windows. And even if some Bitcoin developer were to accidentally introduce a bug into Core, nobody is obligated to run the latest version of it so it will not even bring down their node.

Free PR for crypto!
8  Bitcoin / Project Development / Re: Bitcointalk Search Project on: July 19, 2024, 12:12:29 PM
In case you're (both) wondering: I'm still working on it. Creating a tar on a spinning disk with 50 million files was a bit of a mistake. It's fast to write but takes days to read.

Is that a compressed tarball? Because for my daily backups I usually tar my folders without compression to make it go many times faster.



I'm considering putting together a website and releasing the search, but with incomplete data for now. So far, I have over 23k posts (up to roughly July 2011), but I would not like to wait months before people can actually use this.

It would also give me time to figure out the best setup for this.
9  Economy / Reputation / Re: Wasabi topic hijacked by crooked Kruw on: July 19, 2024, 12:02:46 PM
He wrote some text that appears in the UI. That does not require coding knowledge.

Doesn't surprise me.

(Docs don't require coding knowledge either besides Markdown or ReST.)
10  Bitcoin / Bitcoin Discussion / Re: Is Trump About to Make Bitcoin the US’s Secret Weapon? on: July 19, 2024, 05:39:51 AM
Even though Trump announce it July 27, he will not going to make it happen in the land of US because he want to use bitcoin and cryptocurrency to play politics with bitcoin and cryptocurrency users in the country, some thing like this happened in Nigeria during the campaign that brought Nigeria current president into power which he promised cryptocurrency and bitcoin users that he will lift the banned on cryptocurrency and bitcoin users to have good access to do anything concerning bitcoin and cryptocurrency, but after he won the election and resume office,  he changed his mind to believed the negative things people where saying about bitcoin and cryptocurrency, which is what Trump will going to do if he succeed in this election.

AKA lies, more lies, and statistics.

Both presidential candidates are going to make as many promises as they can in order to secure the White House, and of course they are going to break a lot of them - that's what politicians do. And it's not only applicable in the US. Politicians in all countries regularaly lie in order to get an advantage. It's literally part of the job requirements if you want the job.
11  Bitcoin / Project Development / Re: Bitcointalk Search Project on: July 18, 2024, 05:51:20 PM
If you keep track of every new topic, all you need to do is add +1 to the ID and see if it exists. Some problems might arrive, like the thread being deleted before you check, meaning that the last ID has changed, but there are ways of minimizing this.

I could try checking the next sequence of 100 posts or so, in order to check for new posts, since it's extremely unlikely that they were all deleted.
12  Economy / Service Discussion / Re: 📝[Discussion topic] Overview of Bitcointalk Signature-Ad Campaigns📝 on: July 18, 2024, 05:45:54 PM
Besides, Bitcointalk should not be the sole earning source, it is a side hustle and people should be serious about it.

I'm available on Upwork if you have some prospects you want to offer me.  Wink
13  Economy / Service Discussion / Re: Scams Bitcoin Mixers List and Services closed on: July 18, 2024, 05:43:59 PM
Another thing that can be done to fill the database:
If the legit service is 1234.com, you can consider all others TLDs not safe to use, and add them directly to the DB. It can save some time in the long run Grin

Now I need to figure out how to get a list of all the registered TLDs.
14  Other / Meta / Re: TalkImg.com - Image hosting for BitcoinTalk on: July 18, 2024, 01:00:06 PM
Does your uploader support WebP images, by the way?

I might have known before, but since I have not attempted to upload any WebP to TalkImg, I'm not so sure anymore.

It's a very useful format, and allows me to copy and paste images directly from places like Reddit into other sites without having to save them first.

There was another site I was using that was having problem with WebP image uploading, so that's why I asked.

(Everyone should be supporting this format in 2024, anyway.)
15  Economy / Service Announcements / Re: [ANN] BitMixList.org - Bitcoin Mixer List on: July 18, 2024, 12:54:39 PM
@JeromeTrash that is good to know.

I still have to work on getting the RoyalMix verification working for the letter of guarantee verification. The infra is there, it just needs a public key to be inserted. So one of these days I will check that and will have it working.

And also I would like to add a quick link at the top right corner that takes you to the letter of guarantee verification tool immediately. I think that would be of great utility.
16  Economy / Economics / Re: US-China economy war + radical Trump = higher inflation + interest rates + war on: July 18, 2024, 12:50:14 PM
Another way they've always used to control inflation is to bring down energy prices (remember Petrodollar).
In order to do that they'll have to increase the supply of oil in the global market.
To do that they need to (1) remove sanctions on Russia and let that oil flow again (2) convince Arab dictators to increase their production (3) end the wars in West Asia so that security comes back and they can actually increase production (remember Saudi invasion of Yemen is in a ceasefire state and not ended).

There is a good chance that the next US president will be able to fulfill #1 by ending the NATO-Russian war (possibly recognizing occupied parts as Russia) as there are enough signals coming out from US making me confident about that possibility. Trump will probably have an easier time doing that because of his radical decisions.

The problem will be #2 and #3.
~

I don't think that Ukraine has any intention to end the war any time soon unless they get their occupied providences back (they will also push for Crimea but there is a tiny chance that that part could be negotiated over).

EU too, are not willing to let a ceasefire happen until that is done, because the cold, calculating Putin would like to invade other european countries next after he is done with Ukraine, so they are unlikely to cooperate.
17  Bitcoin / Development & Technical Discussion / Re: Are there any threats when a user translates exposed mnemonic words? on: July 18, 2024, 12:26:45 PM
That is a good point that should be emphasized more. BIP39's proposal actually strongly discourages using any language other than English simply because most wallets don't bother with other languages; BIP39 does have 10 wordlists but you'll probably have people asking why not more.

BIP39 also has no version number which makes it impossible to update the standard, which was another reason behind the motivation to create the Electrum wordlist.

Efforts to add new languages to the BIP39 official wordlist repository have also ground to a halt because nobody is merging those changes into the BIP repository, and in many cases, comments and communication have simply ceased.
18  Bitcoin / Bitcoin Discussion / Re: Linux Inventor Says He Doesn’t Believe in Crypto on: July 18, 2024, 12:06:04 PM
If Elon Musk says he will ban iOS on all products of his companies like Tesla if Apple partner with OpenAI at OS level, will people stop using products of Apple?

It makes no sense. Using or not using, it can be from many reasons, budget, security concern and more. Therefore, many people are not fan of any company or OS, they can use products, OS from different companies.

Elon Musk is basically a nobody in the world of operating systems anyway. And in consumer technology in general, I mean he doesn't even know how to run a social media properly to make it scale. He should just stick to cars and rockets if you ask me.

It baffles me why so many "tech people" are seeking his advice for stuff like that, esp. AI. Anybody can spin up a raggedy-tag AI model thanks to LLaMa 3. So just because Elon has an AI company doesn't make him an oracle here in any sense, and certainly not an authority on phones.
19  Other / Politics & Society / Re: Big Tech Founders Abandoning Biden for Trump Because of Tech Support on: July 18, 2024, 11:55:42 AM
I don't think Biden will be able to win against Trump.

Time is running out for him, and I think he will drop out eventually if it means that the new candidate can defeat Trump.

We see all these businesses backing Trump again like in 2016, because those are the entities who have benefited from his policies in his previous term.

It's a bit disappointing, because a rapist/fraudster/and most importantly sell-out of US secrets should not have been able to run for president in the first place.
20  Bitcoin / Bitcoin Discussion / Re: Maelstrom Has Announced $50,000-$150,000 Tech Dev Grant for Bitcoin Developers on: July 18, 2024, 11:52:30 AM
Good for Bitcoin developers.

The grant is worded like it's mostly focusing on Bitcoin Core developers, but it could also benefit other types of developers too, or even just documentation writers.

I wonder what would happen if we ran a grant for Bitcoin Wiki editors.

That would pretty much bring the wiki back to life for example, and should count as improving Bitcoin since Bitcoin Wiki is the reference point for technical data.
Pages: [1] 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 ... 551 »
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!