I am parsing by regular expression transactions from BitcoinCore base.
trying to get all used bitcoin addresses
Pattern "(1|bc1)[a-km-zA-HJ-NP-Z1-9]{25,34}[^A-Za-z0-9\\p{P}]"
start 1 or bc1, the correct set of letters in the middle, long, and ending with anything other than a letter and a number to cut from the text
the result is
185EVzhyGhS6guJ8qmxddAqKxz4ZnH7xia
1DTdkgwGugz5cRGgScamGRaQrYv8UoYMoU
179f29fc9125d44855d23833aa1b5c5 ---------------
1B3doy3pzzT5CoNxz3euZMqtEkksfwx3hY
1BeysPkUXr8MYbhQiDvbypc6jBrentMRCj
1jmmV7fwJsppj5GRSXeMFo79v5gPtx --------------
1YpkqaE1TJLYbHseGxWowMrrCNJDLMNY ------------
17146647f7a4992bb61b227c271843d1 --------------
1NpLfY5VRiEmk6e65NhnK6SdyfNx2TgbwV
1gmohVXc1A5GGPHsrGhPzN84sPUWhL ----------------
128U9yEHB9zJqzEohBTkySPNNzjPJAkf8X
1usVaTvJUXdue6B6A6jAFGg7u8bRgh9bG --------------
1E9gfEykFQXGgeE2RKa3hjJfdSv2Q5SEY4
"--------" these are NOT bitcoin addresses, although the pattern matches
the crux of the problem: how to distinguish whether a string is a bitcoin address or not? I can't check on the site. a lot of them)
maybe you can count the sum of letters or something like that or change the pattern?