Bitcoin Forum
April 19, 2024, 12:12:05 AM *
News: Latest Bitcoin Core release: 26.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: How does a site like Blockchain.info know which outputs are change?  (Read 1547 times)
hayek (OP)
Sr. Member
****
Offline Offline

Activity: 370
Merit: 250


View Profile
November 28, 2013, 02:53:38 AM
 #1

I can't find anything on the wiki about transaction change.

I know what it is but how does it appear any different than other outputs?
1713485525
Hero Member
*
Offline Offline

Posts: 1713485525

View Profile Personal Message (Offline)

Ignore
1713485525
Reply with quote  #2

1713485525
Report to moderator
The network tries to produce one block per 10 minutes. It does this by automatically adjusting how difficult it is to produce blocks.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713485525
Hero Member
*
Offline Offline

Posts: 1713485525

View Profile Personal Message (Offline)

Ignore
1713485525
Reply with quote  #2

1713485525
Report to moderator
jl2012
Legendary
*
Offline Offline

Activity: 1792
Merit: 1092


View Profile
November 28, 2013, 03:32:01 AM
 #2

I can't find anything on the wiki about transaction change.

I know what it is but how does it appear any different than other outputs?

That's just guestimation

Donation address: 374iXxS4BuqFHsEwwxUuH3nvJ69Y7Hqur3 (Bitcoin ONLY)
LRDGENPLYrcTRssGoZrsCT1hngaH3BVkM4 (LTC)
PGP: D3CC 1772 8600 5BB8 FF67 3294 C524 2A1A B393 6517
maaku
Legendary
*
expert
Offline Offline

Activity: 905
Merit: 1011


View Profile
November 28, 2013, 03:35:07 AM
 #3

It makes a wild-ass guess.

I'm an independent developer working on bitcoin-core, making my living off community donations.
If you like my work, please consider donating yourself: 13snZ4ZyCzaL7358SmgvHGC9AxskqumNxP
ivroer
Member
**
Offline Offline

Activity: 89
Merit: 14


View Profile
November 28, 2013, 03:42:12 AM
 #4

That's just guestimation

It makes a wild-ass guess.

Yup, +1 to these 2. It doesn't know it's a guess.

I imagine there's some guessing logic like, have any of the output addresses been seen on the blockchain before? Yes? It might be the actual "spend"... there might be more logic depending on the ratio of amounts to each address.

But it definitely does not get it correct every time, I've seen plenty of transactions (of my own) where the it has estimated incorrectly.
gmaxwell
Moderator
Legendary
*
expert
Offline Offline

Activity: 4158
Merit: 8382



View Profile WWW
November 28, 2013, 06:29:59 AM
 #5

Like a lot of other things on BC.i that are just guesses, its right often enough to confuse people.
maaku
Legendary
*
expert
Offline Offline

Activity: 905
Merit: 1011


View Profile
November 28, 2013, 06:39:05 AM
 #6

If it were me, I'd do prime decomposition on the amounts, calculate their relative magnitude, a boolean value indicating whether they'd been seen before, etc., label a number of training examples, and have a support vector machine generate a classifier.

There's a million other ways you can do it and get decent results. Doesn't stop it from being a WAG though.

I'm an independent developer working on bitcoin-core, making my living off community donations.
If you like my work, please consider donating yourself: 13snZ4ZyCzaL7358SmgvHGC9AxskqumNxP
scintill
Sr. Member
****
Offline Offline

Activity: 448
Merit: 252


View Profile WWW
November 28, 2013, 06:54:40 AM
 #7

If it were me, I'd do prime decomposition on the amounts, calculate their relative magnitude, a boolean value indicating whether they'd been seen before, etc., label a number of training examples, and have a support vector machine generate a classifier.

There's a million other ways you can do it and get decent results. Doesn't stop it from being a WAG though.

In a typical two-output tx created before ~2013-01-30, there's a good chance the first output is the change address.  Maybe even longer, depending on how long until the fix was widely deployed.

1SCiN5kqkAbxxwesKMsH9GvyWnWP5YK2W | donations
maaku
Legendary
*
expert
Offline Offline

Activity: 905
Merit: 1011


View Profile
November 28, 2013, 07:42:23 AM
 #8

Bitcoin-Qt is not the only wallet application...

I'm an independent developer working on bitcoin-core, making my living off community donations.
If you like my work, please consider donating yourself: 13snZ4ZyCzaL7358SmgvHGC9AxskqumNxP
Sukrim
Legendary
*
Offline Offline

Activity: 2618
Merit: 1006


View Profile
November 28, 2013, 02:58:18 PM
 #9

Also if there are e.g. a 3 BTC input and a 3 BTC input to a 4 BTC output and a 1 BTC output, the change is likely the 1 BTC, since there would have been no real need to combine the inputs otherwise.

Still it often guesses wrong, maybe there is some research potential in there somehow?

https://www.coinlend.org <-- automated lending at various exchanges.
https://www.bitfinex.com <-- Trade BTC for other currencies and vice versa.
dserrano5
Legendary
*
Offline Offline

Activity: 1974
Merit: 1029



View Profile
November 28, 2013, 03:23:37 PM
 #10

Plus, from what I've seen, bc.info doesn't bother with transactions having more than two outputs, the estimated amount is always the whole amount in the tx.
TooDumbForBitcoin
Legendary
*
Offline Offline

Activity: 1638
Merit: 1001



View Profile
November 28, 2013, 04:33:39 PM
 #11

Quote
Also if there are e.g. a 3 BTC input and a 3 BTC input to a 4 BTC output and a 1 BTC output, the change is likely the 1 BTC, since there would have been no real need to combine the inputs otherwise.


That's some big fees, or I'm toodumbforbitcoin.



▄▄                                  ▄▄
 ███▄                            ▄███
  ██████                      ██████
   ███████                  ███████
    ███████                ███████
     ███████              ███████
      ███████            ███████
       ███████▄▄      ▄▄███████
        ██████████████████████
         ████████████████████
          ██████████████████
           ████████████████
            ██████████████
             ███████████
              █████████
               ███████
                █████
                 ██
                  █
veil|     PRIVACY    
     WITHOUT COMPROMISE.      
▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂
|   NO ICO. NO PREMINE. 
   X16RT GPU Mining. Fair distribution.  
|      The first Zerocoin-based Cryptocurrency      
   WITH ALWAYS-ON PRIVACY.  
|



                   ▄▄████
              ▄▄████████▌
         ▄▄█████████▀███
    ▄▄██████████▀▀ ▄███▌
▄████████████▀▀  ▄█████
▀▀▀███████▀   ▄███████▌
      ██    ▄█████████
       █  ▄██████████▌
       █  ███████████
       █ ██▀ ▀██████▌
       ██▀     ▀████
                 ▀█▌




   ▄███████
   ████████
   ███▀
   ███
██████████
██████████
   ███
   ███
   ███
   ███
   ███
   ███




     ▄▄█▀▀ ▄▄▄▄▄▄▄▄ ▀▀█▄▄
   ▐██▄▄██████████████▄▄██▌
   ████████████████████████
  ▐████████████████████████▌
  ███████▀▀▀██████▀▀▀███████
 ▐██████     ████     ██████▌
 ███████     ████     ███████
▐████████▄▄▄██████▄▄▄████████▌
▐████████████████████████████▌
 █████▄▄▀▀▀▀██████▀▀▀▀▄▄█████
  ▀▀██████          ██████▀▀
      ▀▀▀            ▀▀▀
drawingthesun
Legendary
*
Offline Offline

Activity: 1176
Merit: 1015


View Profile
November 28, 2013, 04:37:49 PM
 #12

Blockchain.info is very misleading.

The estimated transaction volume is trite, pure utter guess work.

Someone could buy a coffee and it could show up as a $100,000,000 transaction.

Also the IP address stuff is crap too, so misleading. The ip is the node that relays the transaction to the blockchain node and in no way represents where the actual transaction originated from.
TooDumbForBitcoin
Legendary
*
Offline Offline

Activity: 1638
Merit: 1001



View Profile
November 28, 2013, 04:57:16 PM
 #13

Not to mention their 650W/Gh/s electricity consumption nonsense.  Every now and then the MSM picks that up.



▄▄                                  ▄▄
 ███▄                            ▄███
  ██████                      ██████
   ███████                  ███████
    ███████                ███████
     ███████              ███████
      ███████            ███████
       ███████▄▄      ▄▄███████
        ██████████████████████
         ████████████████████
          ██████████████████
           ████████████████
            ██████████████
             ███████████
              █████████
               ███████
                █████
                 ██
                  █
veil|     PRIVACY    
     WITHOUT COMPROMISE.      
▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂
|   NO ICO. NO PREMINE. 
   X16RT GPU Mining. Fair distribution.  
|      The first Zerocoin-based Cryptocurrency      
   WITH ALWAYS-ON PRIVACY.  
|



                   ▄▄████
              ▄▄████████▌
         ▄▄█████████▀███
    ▄▄██████████▀▀ ▄███▌
▄████████████▀▀  ▄█████
▀▀▀███████▀   ▄███████▌
      ██    ▄█████████
       █  ▄██████████▌
       █  ███████████
       █ ██▀ ▀██████▌
       ██▀     ▀████
                 ▀█▌




   ▄███████
   ████████
   ███▀
   ███
██████████
██████████
   ███
   ███
   ███
   ███
   ███
   ███




     ▄▄█▀▀ ▄▄▄▄▄▄▄▄ ▀▀█▄▄
   ▐██▄▄██████████████▄▄██▌
   ████████████████████████
  ▐████████████████████████▌
  ███████▀▀▀██████▀▀▀███████
 ▐██████     ████     ██████▌
 ███████     ████     ███████
▐████████▄▄▄██████▄▄▄████████▌
▐████████████████████████████▌
 █████▄▄▀▀▀▀██████▀▀▀▀▄▄█████
  ▀▀██████          ██████▀▀
      ▀▀▀            ▀▀▀
moderate
Member
**
Offline Offline

Activity: 98
Merit: 10

nearly dead


View Profile
November 28, 2013, 06:38:11 PM
 #14

If it were me, I'd do prime decomposition on the amounts, calculate their relative magnitude, a boolean value indicating whether they'd been seen before, etc., label a number of training examples, and have a support vector machine generate a classifier.

I'm so glad it is not you, that kind of thing is exactly someone fascinated with machine learning would go for. So many thousands and thousands of crap papers where guys blindly go after machine learning -- and it is mostly always svm --, without even considering other methods, reporting results close to 100% accuracy and other metrics just to find out that they don't even know how to setup training/testing sets, neither have a clue about the features they are using.
piuk
Hero Member
*****
expert
Offline Offline

Activity: 910
Merit: 1005



View Profile WWW
November 28, 2013, 07:12:21 PM
Last edit: November 28, 2013, 07:25:53 PM by piuk
 #15

The logic is pretty simple:

- Remove all outputs matching any input addresses.
- If the transaction has one input take the smallest output.
- If a transaction has more than two inputs and exactly two outputs take the output with a value closest to the total input value.
- If a transaction has more than two outputs return the value of the smallest output.

Anyone is welcome to suggest improvements.

If you were really determined the accuracy could be improved by analysing the taint of the inputs used in the next transaction.

maaku
Legendary
*
expert
Offline Offline

Activity: 905
Merit: 1011


View Profile
November 28, 2013, 08:33:04 PM
 #16

If it were me, I'd do prime decomposition on the amounts, calculate their relative magnitude, a boolean value indicating whether they'd been seen before, etc., label a number of training examples, and have a support vector machine generate a classifier.

I'm so glad it is not you, that kind of thing is exactly someone fascinated with machine learning would go for. So many thousands and thousands of crap papers where guys blindly go after machine learning -- and it is mostly always svm --, without even considering other methods, reporting results close to 100% accuracy and other metrics just to find out that they don't even know how to setup training/testing sets, neither have a clue about the features they are using.

Yes, because when faced with a classic machine learning problem, the tried and true techniques of machine learning are not what you'd want to use.

I'm an independent developer working on bitcoin-core, making my living off community donations.
If you like my work, please consider donating yourself: 13snZ4ZyCzaL7358SmgvHGC9AxskqumNxP
scintill
Sr. Member
****
Offline Offline

Activity: 448
Merit: 252


View Profile WWW
November 28, 2013, 11:57:11 PM
 #17

Bitcoin-Qt is not the only wallet application...

Sure, I meant the probability skews a bit.  In practice maybe it doesn't help much.

1SCiN5kqkAbxxwesKMsH9GvyWnWP5YK2W | donations
Sukrim
Legendary
*
Offline Offline

Activity: 2618
Merit: 1006


View Profile
November 29, 2013, 03:09:29 AM
 #18

Quote
Also if there are e.g. a 3 BTC input and a 3 BTC input to a 4 BTC output and a 1 BTC output, the change is likely the 1 BTC, since there would have been no real need to combine the inputs otherwise.


That's some big fees, or I'm toodumbforbitcoin.
Yeah, I meant 2+3BTC inputs, not 3+3... Wink

https://www.coinlend.org <-- automated lending at various exchanges.
https://www.bitfinex.com <-- Trade BTC for other currencies and vice versa.
Remember remember the 5th of November
Legendary
*
Offline Offline

Activity: 1862
Merit: 1011

Reverse engineer from time to time


View Profile
November 29, 2013, 03:44:11 AM
 #19

Blockchain.info is very misleading.

The estimated transaction volume is trite, pure utter guess work.

Someone could buy a coffee and it could show up as a $100,000,000 transaction.

Also the IP address stuff is crap too, so misleading. The ip is the node that relays the transaction to the blockchain node and in no way represents where the actual transaction originated from.
Really? I actually tracked this guy that mined on top of the genesis blocks(orphans duh) using the IP address on the site, and he confirmed it was him.

BTC:1AiCRMxgf1ptVQwx6hDuKMu4f7F27QmJC2
Peter Todd
Legendary
*
expert
Offline Offline

Activity: 1120
Merit: 1149


View Profile
November 29, 2013, 07:50:23 AM
 #20

Blockchain.info is very misleading.

The estimated transaction volume is trite, pure utter guess work.

Someone could buy a coffee and it could show up as a $100,000,000 transaction.

Also the IP address stuff is crap too, so misleading. The ip is the node that relays the transaction to the blockchain node and in no way represents where the actual transaction originated from.
Really? I actually tracked this guy that mined on top of the genesis blocks(orphans duh) using the IP address on the site, and he confirmed it was him.

That's a special case because no other node would have relayed those blocks; in the general case the IP addresses are bullshit.

Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!