Bitcoin Forum

Bitcoin => Mining speculation => Topic started by: organofcorti on November 16, 2012, 07:28:04 AM



Title: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on November 16, 2012, 07:28:04 AM
Below is the original short range forecast post. Long range forecast posts start here (https://bitcointalk.org/index.php?topic=125171.msg1369098#msg1369098).

######

I've written a new post (http://organofcorti.blogspot.com.au/2012/11/102-forecasting-network-hashrate.html) about the MT GOX US$BTC price affecting miner confidence in terms of network hashrate rather than Difficulty. I derive a rather nice model from hints the cross correlation function provided, and try to interpret what model might explain about the nature of the historic network hashrate. I've also made the charts a little smaller.

I hope you find this discovery as interesting as I did.

Quote
In summary:
  • Model 0: log(H) ~ 1.74 + 0.94lag1(log(H)) + 0.21lag1(log(p)) - 0.14lag4(log(p)) . This will be within +/- 15.1% of the actual network hashrate with 95% confidence.
  • All terms of the model are significant.
  • Model 0 tells us that the network hashrate is generally conservative - one week's average will remains similar to the previous week's average, and will see positive or negatve changes due to the weekly median MTGOX US$-BTC price from the previous week and three weeks before that.
  • Model 0 shows that changes in the network hashrate are generally attributable to price; however a volatile market has minor effect on the weekly average network hashrate.
  • A robust longer range forecast model and a more sensitive longer range forecast model will be address in the next post or two.

http://organofcorti.blogspot.com/2012/11/102-forecasting-network-hashrate.html


http://1.bp.blogspot.com/-oYbeCTBuf7I/UKTuR6Wx5iI/AAAAAAAADdM/OfULleYplU4/s1600/npw10.2.chart2.2012-11-16.png


Title: Re: Short range forecast of the network hashrate
Post by: bcpokey on November 16, 2012, 05:39:37 PM
Good post, and kudos for the impressive modelling. It really honed in there.
I was all set to buy some bitcoins when they fell to $10.50ish (I've been out of the game for a while but picking back up interest), but alas around $10.70 they started trending upwards, and are now ~$11.70, so I have no bitcoins to pass along for your fine work  :'(

And I don't say that simply because I got an honorable mention ;)


Title: Re: Short range forecast of the network hashrate
Post by: meowmeowbrowncow on November 16, 2012, 05:49:30 PM


How do this maths?


Title: Re: Short range forecast of the network hashrate
Post by: bitcoindaddy on November 16, 2012, 06:40:47 PM
You lost me starting with the word "I've".  Does this model tell you anything about what will happen to the BTC price when all the ASIC power comes online?


Title: Re: Short range forecast of the network hashrate
Post by: organofcorti on November 16, 2012, 08:04:55 PM
You lost me starting with the word "I've"

Sorry about that. I promise I'll use it less often :)

Does this model tell you anything about what will happen to the BTC price when all the ASIC power comes online?

No, but it could tell you if there's been any sneaky ASIC mining done by a manufacturer.



Title: Re: Short range forecast of the network hashrate
Post by: organofcorti on November 16, 2012, 08:56:26 PM


How do this maths?

The maths in which part? I dont even


Title: Re: Short range forecast of the network hashrate
Post by: organofcorti on November 16, 2012, 09:00:18 PM
Good post, and kudos for the impressive modelling. It really honed in there.
I was all set to buy some bitcoins when they fell to $10.50ish (I've been out of the game for a while but picking back up interest), but alas around $10.70 they started trending upwards, and are now ~$11.70, so I have no bitcoins to pass along for your fine work  :'(
Stupid Wordpress accepting bitcoin now. Look what they did to the exchange rate.

Anyway, there'll be another post or two on this type of model, plenty of time for donations ;)



Title: Re: Short range forecast of the network hashrate
Post by: bitboyben on November 17, 2012, 06:24:53 AM
Looking forward to your next posts. I think you are getting to some real helpful information.
Price does drive difficulty and how it does is clearly demonstrated. Should be easy for miners to make decisions on investing in hardware.


Title: Re: Short range forecast of the network hashrate
Post by: phr33 on November 17, 2012, 10:50:18 AM
Good post, and kudos for the impressive modelling. It really honed in there.
I was all set to buy some bitcoins when they fell to $10.50ish (I've been out of the game for a while but picking back up interest), but alas around $10.70 they started trending upwards, and are now ~$11.70, so I have no bitcoins to pass along for your fine work  :'(
Stupid Wordpress accepting bitcoin now. Look what they did to the exchange rate.

Anyway, there'll be another post or two on this type of model, plenty of time for donations ;)



Hehe, you will have to incorporate that in your model then! :D


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on November 30, 2012, 03:05:51 PM
With molecular's help I've finally finished the post on long range network forecasts of up to four weeks. I'm surprised at how accurate the forecasts have been in recent times, although with the current changes I don't see this lasting long. Anyway, I've provided everything you need to getstarted making you own predictions (and doing your own analysis, should you wish): at:

http://organofcorti.blogspot.com/2012/12/104-long-range-forecasts-of-network.html

I've also posted the numerical forecasts here:

One week forecast (http://bitbin.it/4Hjo93XU)
Two week forecast (http://bitbin.it/Zvv2WZQO)
Three week forecast (http://bitbin.it/1fHc0lF3)
Four week forecast (http://www.bitbin.it/IYjstbCU)
Canary (http://www.bitbin.it/OXax5LUu)


Short except below:

Quote
4. Summary

  • Up to four week forecasts of the network hashrate can be modelled and provide reasonable accuracy
  • The forecast models of two to four weeks do not help us understand the nature of the relationship between the MTGOX US$BTC exchange rate and the network hashrate in the same way the best model (forecast one week) does.


I'll try to update the forecasts when I can, but you now have the necessary tools to do it yourself should you wish. PM me at the bitcointalk.org forums if you need help converting data to a time series to do your own analysis.


http://3.bp.blogspot.com/-U93tlqjtRs8/ULjDBNa3SsI/AAAAAAAAD_g/xNmjqXQ-0DU/s1600/npw10.4.chart4.2012-12-01.png


Title: Re: Long range forecast of the network hashrate
Post by: Dalkore on November 30, 2012, 10:06:33 PM
Short answer:  Higher, much higher.   We will have some calm while ACIS are being readied and deployed but after that, increases in orders of magnitude. 


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on November 30, 2012, 10:33:13 PM
Short answer:  Higher, much higher.   We will have some calm while ACIS are being readied and deployed but after that, increases in orders of magnitude. 

I wrote something much the same in the last two blog posts in the series. I didn't want to mention it again, since it I went over that in detail, but yes with ASICs coming we expect the hashrate to be higher. Once the hashrate is outside the confidence interval for error, we know an influence other than price is affecting the hashrate, compared to historical data.


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on December 04, 2012, 12:47:38 PM
I thought I'd best update this thread with the news that I'm now posting weekly (ish) network forecasts:

http://organofcorti.blogspot.com/2012/12/weekly-network-forecast-3rd-december.html

I'll post regular updates on a different board: https://bitcointalk.org/index.php?topic=129059.0

http://1.bp.blogspot.com/-MZfVrjGQrDs/UL3gZZiZ26I/AAAAAAAAEE4/vs0Si8bEDU4/s1600/Estimates.table.2012-12-04.png

http://4.bp.blogspot.com/-JG42WpPCuqU/UL3jyqYGduI/AAAAAAAAEFw/8pndrzpMwTY/s1600/NetworkForecast.2012-12-04.png




Title: Re: Long range forecast of the network hashrate
Post by: phorensic on December 05, 2012, 03:44:36 AM
http://i897.photobucket.com/albums/ac172/yeti5d/mooninites.jpg


Title: Re: Long range forecast of the network hashrate
Post by: crazyates on December 05, 2012, 05:17:52 AM
I understand about 5% of eacy of your posts, organofcorti. You could be posting graphs about how many cheerios you ate for breakfast that day, and I'd still believe you.


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on December 05, 2012, 05:25:28 AM
I understand about 5% of each of your posts, organofcorti. You could be posting graphs about how many cheerios you ate for breakfast that day, and I'd still believe you.

See, you *did* follow it!

You bring up a good point though. I try to make posts as accessible as possible for a wide audience, without losing too much information. I guess I'm failing at this. The best thing to do is just post something along the lines of "Can you explain or rephrase <whatever it is that you found unintelligible>", either here or the blog. Also, I work two jobs and can only work on posts late at night, so anything you don't follow could be a mistake or a poorly phrased explanation on my part.

For the weekly network forecasts though, the tables are pretty self explanatory, and the date and hashrate at the top of each chart is my forecast for that date as of the posting date.




Title: Re: Long range forecast of the network hashrate
Post by: crazyates on December 05, 2012, 06:12:43 AM
I understand about 5% of each of your posts, organofcorti. You could be posting graphs about how many cheerios you ate for breakfast that day, and I'd still believe you.
See, you *did* follow it!

You bring up a good point though. I try to make posts as accessible as possible for a wide audience, without losing too much information. I guess I'm failing at this. The best thing to do is just post something along the lines of "Can you explain or rephrase <whatever it is that you found unintelligible>", either here or the blog. Also, I work two jobs and can only work on posts late at night, so anything you don't follow could be a mistake or a poorly phrased explanation on my part.

For the weekly network forecasts though, the tables are pretty self explanatory, and the date and hashrate at the top of each chart is my forecast for that date as of the posting date.
Haha I knew it!

Well 5% may have been a bit of an exaggeration. I understand most of your tables/charts/graphs, and really appreciate your projections. I just don't understand quote HOW you got to those conclusions, which to be fair I don't really expect you to educate me if I can't already figure it out. You've already posted quite a bit of information for being as busy as you sound!

Keep up the good work!


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on December 10, 2012, 05:16:35 AM
Update:

Quote
0.Introduction
This week's forecast is interesting, since it shows an influence other than price and hashrate having an effect on the current network hashrate. This influence is of course likely to be the block reward halving reducing the number of miners that find it profitable to mine, and hence recucing the network hashrate.

3. Models and datasets:
The model datasets have been collected into one paste to save time. Model estimates have been likewise aggregated.

Forecast and canary model analysis (http://www.bitbin.it/AmOgKQxD)
All datasets (http://bitbin.it/5OrmAHZ4)
All estimates (http://bitbin.it/Zi3UPVCh)

2. Results
The Canary model shows its value here. The drop in network hashrate is outside the 95% confidence interval for the percentage error; in fact it's outside the 99.9% confidence interval. I had hoped for this result which is an indicator that the Canary model may be useful for detecting when ASICs start to hash in significant numbers.

The Model.f1 prediction for this week is also outside the 95% confidence interval - this was not completely unexpected since the change is significant enough to be outside the expected error range.



Title: Re: Long range forecast of the network hashrate
Post by: 420 on December 11, 2012, 03:23:49 AM
were they right


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on December 11, 2012, 03:58:26 AM
were they right

Were who right? If you mean Dalkore, then no, he wasn't right. My estimates were much closer.


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on December 24, 2012, 05:51:39 AM
0.Introduction
It could be suggested that the reward halving continues to produce a significant effect outside of the direct USDBTC comparison, and certainly that's what I intuit. I suspect only a doubling of the MTGOX US$BTC price or a halving of the network hashrate will bring the model on track, but time will tell.


1. Models and datasets:
The model datasets have been collected into one paste to save time. Model estimates have been likewise aggregated.

Forecast and canary model analysis (http://www.bitbin.it/AmOgKQxD)
All datasets (http://www.bitbin.it/2VbuQ3sW)
All estimates (http://www.bitbin.it/EQsRli2I)

2. Results
Model.f1, the one week forecast, recovered to within the 95% confidence interval range for last weeks's forecast of this week. I think this shows the importance of including the network hashrate lag variables. As somewhat expected the Canary model is still modelling a hashrate much higher than the current network hashrate, which is outside the 95% confidence interval for that model for the third week in a row.

http://organofcorti.blogspot.com/2012/12/weekly-network-forecast-24th-december.html






Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on January 05, 2013, 07:59:40 AM
This week's update, report from the blog:

http://organofcorti.blogspot.com.au/2013/01/weekly-network-forecast-31st-december.html

Weekly network forecast 31st December 2012

0.Introduction
I apologise for the delay in posting this update. The real world intervened in a most pleasant manner.

I've added some more error information this week to enable an easier comparison between forecast and actual network hashrate. The forecast models are recovering already, with all predictions within the estimated error range. It seems the reward halving has only had a temporary effect on the forecast model accuracy so far.

1. Models and datasets:
The model datasets have been collected into one paste to save time. Model estimates have been likewise aggregated.

Forecast and canary model analysis (http://www.bitbin.it/AmOgKQxD)
All datasets (http://www.bitbin.it/xMHgQypr)
All estimates (http://www.bitbin.it/M3tPDEaV)

2. Results
  • Canary model (current hashrate estimate based only on current price and previous network hashrates): This model's error has recovered to 4% of the actual weekly average network hashrate this week. The Canary model was outside the expected range for only 3 weeks after the reward halving, so it seems to remain a good indicator of the onset of changes other than the MTGOX BTCUSD price and previous network hashrate averages.
  • Model.f1 (one week forecast): Model.f1's error recovered to 5% of the weekly average network hashrate, almost as low an average error as in the weeks leading up to the reward halving. Hopefully the model will remain useful until the ASIC hashrates are added.
  • Models f2, f3 and f4 errors are high, and although within the estimated 95% confidence interval for the error are not yet useful for long range forecasts.



Donations help give me the time to analyse bitcoin mining related issues and write these posts. If you enjoy or find them helpful, please consider a small bitcoin donation:
12QxPHEuxDrs7mCyGSx1iVSozTwtquDB3r

Thanks to the following for use of their data:
blockexplorer.com:  1Cvvr8AsCfbbVQ2xoWiFD1Gb2VRbGsEf28
blockchain.info
molecular: 1MoLECHau3nb9zV78uKraNdcZsJ2REwxuL


Title: Re: Long range forecast of the network hashrate
Post by: 420 on January 06, 2013, 01:15:55 AM
estimate without asic introduciton... ? right :)


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on January 06, 2013, 03:19:33 AM
estimate without asic introduciton... ? right :)

Yes. As mentioned previously, the method breaks down when changes to the network are effected by causative factors other than price or the previous network hashrate. That's why the forecasts were so inaccurate after the reward halving. I'm surprised the forecasts are already becoming accurate again so soon.

This could help pinpoint when the first ASICs start hashing.

More details:

https://bitcointalk.org/index.php?topic=127379.0
http://organofcorti.blogspot.com/2012/11/103-canaries-coal-mines-and-black-swans.html


Title: Re: Long range forecast of the network hashrate
Post by: Raize on January 07, 2013, 06:03:36 PM
Regardless of whether or not ASICs drop and change the way modelling estimates are done, someone needs to be doing this so those of us both trading and mining can have some sort of foresight. This is impressive work, organofcorti. Thanks for doing this.


Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on January 07, 2013, 10:23:10 PM
Regardless of whether or not ASICs drop and change the way modelling estimates are done, someone needs to be doing this so those of us both trading and mining can have some sort of foresight. This is impressive work, organofcorti. Thanks for doing this.

I'm glad you find it useful - I was thinking the simplicity and accuracy of the models were interesting enough to follow up with a weekly update, but I wasn't sure anyone would be able to use the data given the way the errors can increase as the length of the forecast increases.
 



Title: Re: Long range forecast of the network hashrate
Post by: organofcorti on January 08, 2013, 08:10:31 AM
This week's update, report from the blog:

http://organofcorti.blogspot.com/2013/01/weekly-network-forecast-7th-january-2013.html

Note: The charts are now taking up so much room I'm no longer posting them here; you can see them in all their glory at the blog (http://organofcorti.blogspot.com/2013/01/weekly-network-forecast-7th-january-2013.html).

Weekly network forecast 7th January 2013

0.Introduction
If you're happy with < 10% error in your forecasts, this week's errors were acceptably useful. Since (barring the changes due to the reward halving) accuracy has been generally good,  it's possible to use the weekly hashrate forecasts to estimate the date of a retarget and a forecast estimate for the retarget difficulty.

For example based on this weeks model f1, f2, f3 and f4 weekly hashrate forecasts, we can forecast the retarget at block 217728 will be on 21st January and difficulty will change to ~ 3657934.

So this for this week I've included:
  • A table of current retarget date and difficulty estimates
  • A table of previous retarget date and difficulty estimates, and the actual retarget dates and difficulties as a comparison
  • A chart of the last twenty six weeks of retarget date and difficulty estimates, actual retarget dates and actual difficulties.

It should be noted that each retarget will often be forecast by two consecutive weekly forecasts, hence the multiple points per retarget on the new chart. Please post a comment if it's not clear.


1. Models and datasets:
The model datasets have been collected into one paste to save time. Model estimates have been likewise aggregated.

Forecast and canary model analysis (http://www.bitbin.it/AmOgKQxD)
All datasets (http://www.bitbin.it/gSkamvYj)
All estimates (http://www.bitbin.it/6hQ3o1qO)
Difficulty data and estimates (http://www.bitbin.it/tkfGZSkD)

2. Results
  • Canary model (current hashrate estimate based only on current price and previous network hashrates): This model's error has recovered to 4% of the actual weekly average network hashrate this week. The Canary model was outside the expected range for only 3 weeks after the reward halving, so it seems to remain a good indicator of the onset of changes other than the MTGOX BTCUSD price and previous network hashrate averages.
  • Model.f1 (one week forecast): Model.f1's error recovered to 5% of the weekly average network hashrate, almost as low an average error as in the weeks leading up to the reward halving. Hopefully the model will remain useful until the ASIC hashrates are added.
  • Models f2, f3 and f4 errors are high, and although within the estimated 95% confidence interval for the error are not yet useful for long range forecasts.
  • The large negative difficulty change after the block reward halving was (of course) not predicted and stands out as a clear error. However the new estimates look on target, and I think an estimate of Difficulty ~ 3.6 million after the next retarget is reasonable.




Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: qbits on January 11, 2013, 07:13:40 AM
Below is the original short range forecast post. Long range forecast posts start here (https://bitcointalk.org/index.php?topic=125171.msg1369098#msg1369098).

  • Model 0: log(H) ~ 1.74 + 0.94lag1(log(H)) + 0.21lag1(log(p)) - 0.14lag4(log(p)) . This will be within +/- 15.1% of the actual network hashrate with 95% confidence.

You are modeling/training your forecast function using the same data that you use to verify it's forecast accuracy. This is not a proper way to create a data model as all you get is a function that is able to accurately model past price movements and you have no information on it's accuracy.

Proper way is to split the historical pricing data into at least two sets:
- learning set, from which you deduce your function parameters. You can use jan-jun 2012 price data for example.
- test set, which you can use to verify function accuracy, that is how successfully it models price data. You can use jul-dec 2012 for example.
- possibly you could have more test sets, 2011 data for example.

If you were to do what I suggested you would find that your model is not very good at modeling price data and hence a very bad predictor/forcaster of future data.

This is not your fault, it's just that you have taken on a very very difficult problem to solve...



Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on January 11, 2013, 07:41:56 AM
Below is the original short range forecast post. Long range forecast posts start here (https://bitcointalk.org/index.php?topic=125171.msg1369098#msg1369098).

  • Model 0: log(H) ~ 1.74 + 0.94lag1(log(H)) + 0.21lag1(log(p)) - 0.14lag4(log(p)) . This will be within +/- 15.1% of the actual network hashrate with 95% confidence.

You are modeling/training your forecast function using the same data that you use to verify it's forecast accuracy. This is not a proper way to create a data model as all you get is a function that is able to accurately model past price movements and you have no information on it's accuracy

Proper way is to split the historical pricing data into at least two sets:
- learning set, from which you deduce your function parameters. You can use jan-jun 2012 price data for example.
- test set, which you can use to verify function accuracy, that is how successfully it models price data. You can use jul-dec 2012 for example.
- possibly you could have more test sets, 2011 data for example.

Thanks for your feedback.

I've always been aware that I might have been overfitting. I couldn't split the data - there's just so little data to be had. Also, various points in time have had slightly differing auto - and cross correlations and I wanted to be able to produce a simple linear model that could account for all data, especially since I'm only using two variables out of all the variables that can affect the network hashrate from time to time.

Instead I decided to use all the data I had, using a linear model with minimum of coefficients and lagged variables. Since then I have applied the model weekly. Imagine the initial post as the "training" phase and my current posts as the "test set". It seems so far that the model has been predicting future network hashrate - and future network mining difficulty - far better than I expected.

Also, it seems you're assuming I'm modelling price. I'm not modelling price data - I'm using lagged historical price data (and lagged historical network hashrate data) to provide a forecast of the network hashrate. There is no modelling of price at all, just a 1 to 4 week forecast of the network hashrate, with confidence intervals for the error (which I'll be the first to admit are quite large at the 3 and 4 week forecast).

If you were to do what I suggested you would find that your model is not very good at modeling price data and hence a very bad predictor/forcaster of future data.

This is not your fault, it's just that you have taken on a very very difficult problem to solve...

However, as I mentioned I'm not modelling price data. I've used genetic algorithms to do that before using the ADF test (http://en.wikipedia.org/wiki/Augmented_Dickey–Fuller_test) with some success - but over time it was not enough to beat the ask/bid spread. So I'll leave that to the finance geeks ;)

If you look at my weekly update posts, you'll see I'm assessing the models as I go - not changing them, just assessing them. So far they have performed as expected - within the 95% CI for error, except when the reward halving occurred ( a variable I cannot account for in a simple linear/lag function ).

If I haven't explained this well, take a look at the long range forecast post:
http://organofcorti.blogspot.com/2012/12/104-long-range-forecasts-of-network.html

and the most recent update post:
http://organofcorti.blogspot.com/2013/01/weekly-network-forecast-7th-january-2013.html

I would be interested to read what you think.


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: qbits on January 11, 2013, 11:08:27 PM
First let me correct what I said in the earlier post when I spoke of price history data: I meant difficulty history data. Indeed you are modeling difficulty and not price I must have confused the two :)

I read your report and found this: http://2.bp.blogspot.com/-YgYQ-OX3S5E/UOvLIj9-6SI/AAAAAAAAEoQ/drCPQ70jf7U/s1600/DifficultyForecast.2013-01-08.png

As soon as you try to forcast data into any significant future, your model is way off real data.

Anyway splitting data into learning and test sets is a must. That's the only way to test the validity of the model with any confidence.

Another suggestion would be to try other modelling techniques such as neural networks. You can find information about all these in books covering Machine Learning...







Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on January 12, 2013, 02:00:14 PM
First let me correct what I said in the earlier post when I spoke of price history data: I meant difficulty history data. Indeed you are modeling difficulty and not price I must have confused the two :)

I read your report and found this: http://2.bp.blogspot.com/-YgYQ-OX3S5E/UOvLIj9-6SI/AAAAAAAAEoQ/drCPQ70jf7U/s1600/DifficultyForecast.2013-01-08.png

As soon as you try to forcast data into any significant future, your model is way off real data.


As mentioned in the post, I expected the model to fail when the block reward halving occurred. In the chart to which you link, I asume you refer to the inability of the network hashrate models to predict difficulty retargets correctly? I think it's done a rather good job. The only prediction that has been significantly different from the actual difficulty retarget is the one immediately after the reward halving. The other difficulty predictions (after making the model in November) have been better than I expected, since I'm not modelling it directly and can't provide a confidence interval.

Anyway splitting data into learning and test sets is a must. That's the only way to test the validity of the model with any confidence.

Can you provide a source for this assertion? I've found no books on ARIMA modelling that make such a claim - that training and testing sets are necessary for forecasts of auto- and cross-correlative models. These sets might be necessary when using symbolic regression and (sometimes) when using genetic algorithms to model time series data, but it's not mentioned in the ARIMA texts I've read.

Do keep in mind that in (for example) a univariate series of one hundred weekly average hashrate data points, we have an unknown number of known independent variables (historical price and hashrate data) and a unknown number of unknown independent variables. Some of those unknown variables have been / will be rare but have had / will have a significant effect on the network hashrate which cannot be accounted for by the model. Splitting the data into two smaller sets risks the unknown variables having a significant effect on the accuracy of the model.

Another suggestion would be to try other modelling techniques such as neural networks. You can find information about all these in books covering Machine Learning...

That could be fun, but I'm not trying to provide the most accurate forecast possible - I'm using the simplest possible method to achieve an aim, explain how it's done and hopefully interest some readers to try it for themselves - or go one step further. Anyone who has a basic level of math and coding skills should be able to replicate my work and at the same time know what they're doing. If my work encourages someone to develop a truly accurate forecast, I'll be very happy.

In the meantime, I provide 95% confidence intervals for the hashrate forecasts based on historical data and so far the forecasts have only exceeded the confidence interval only when an unknown variable (the reward halving) has an effect on the network. I'm surprised by the model's rapid recovery post the block halving.

We'll just have to see how it goes. Care to make a wager? ;)


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: qbits on January 12, 2013, 02:27:45 PM

Can you provide a source for this assertion? I've found no books on ARIMA modelling that make such a claim - that training and testing sets are necessary for forecasts of auto- and cross-correlative models. These sets might be necessary when using symbolic regression and (sometimes) when using genetic algorithms to model time series data, but it's not mentioned in the ARIMA texts I've read.

In the meantime, I provide 95% confidence intervals for the hashrate forecasts based on historical data and so far the forecasts have only exceeded the confidence interval only when an unknown variable (the reward halving) has an effect on the network. I'm surprised by the model's rapid recovery post the block halving.

We'll just have to see how it goes. Care to make a wager? ;)


ad 1. start here: http://en.wikipedia.org/wiki/Cross-validation_(statistics). Father Google will provide further references + book recommendations

ad 2. how far into the future are you forecasting?

ad 3. sure! I'll bet you 1 BTC :)


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on January 14, 2013, 06:27:21 AM
ad 1. start here: http://en.wikipedia.org/wiki/Cross-validation_(statistics). Father Google will provide further references + book recommendations
No mention that it's the only way to test the validity of a model with confidence, just that it is one method.

ad 2. how far into the future are you forecasting?
Model.f1 forecasts 1 week ahead
Model.f2 forecasts 2 weeks ahead
Model.f3 forecasts 3 weeks ahead
Model.f4 forecasts 4 weeks ahead
The Canary model attempts to detect effects of unknown variables.

ad 3. sure! I'll bet you 1 BTC :)

OK - I'll bet 1 btc that for the next 5 weeks the actual weekly network average hashrate will be within 13% of the Model.f1 prediction, unless the canary model indicates an external (non hashrate or price) influence.

Now, who will be third party escrow? :)


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on January 14, 2013, 08:50:28 AM
Note: The charts are now taking up so much room I'm no longer posting them here; you can see them in all their glory at the blog (http://organofcorti.blogspot.com/2013/01/weekly-network-forecast-14th-january.html).

Weekly network forecast 14th January 2013

0.Introduction
The Canary model error for this week turns out to be more than the 95% confidence interval for error, and well outside the 95% confidence interval for the network hashrate estimate. This  possibly implies the effect of an external influence, or more likely due to the 3.2% increase in price coupled with an unexpected 7.7% decrease in the weekly average network hashrate.
  • If there is some sort of external influence which has caused some proportion of miners to switch off suddenly, then the model recovery will likely be slow, as it was after the block reward halving.
  • If the drop in hashrate with an increase in price is a random event, models should recover to within the expected range next week.
  • If the last two weeks were anomalous and the block reward halving is having a continued effect on the network, then model error will continue to be outside the 95% confidence interval for error an unknown period of time.



Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: tacotime on January 18, 2013, 03:42:24 AM
I'm guessing that the 20 TH/s brought by Avalon will have the network hash rate up to 40 TH/s in a week or two, since anyone who has one will be mining with it.


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on March 25, 2013, 10:29:50 AM
I haven't updated this for a while, so here are the most recent charts:

http://organofcorti.blogspot.com/2013/03/weekly-network-forecast-25th-march-2013.html

http://2.bp.blogspot.com/-Mw7v6G66FrI/UVAMBIdasJI/AAAAAAAAHBs/1d_h8lMvCds/s1600/NetworkForecast.2013-03-25.png

http://3.bp.blogspot.com/-A3-_lA72ck4/UVAL9bqlDQI/AAAAAAAAHAw/ZMDR2kpxHL8/s1600/Canary.2013-03-25.png

http://2.bp.blogspot.com/-SmrrWXiljjM/UVAL-9hqduI/AAAAAAAAHBQ/zj7TuoI6GiQ/s1600/DifficultyForecast.2013-03-25.png

http://1.bp.blogspot.com/-YtABCa2FPOQ/UVAL_LQmR9I/AAAAAAAAHBU/RHaXe_MAkh8/s1600/DifficultyForecast2.2013-03-25.png



Title: Re: Long range forecast of the network hashrate
Post by: Dalkore on March 25, 2013, 04:39:19 PM
were they right

Were who right? If you mean Dalkore, then no, he wasn't right. My estimates were much closer.

Yes, please take his estimates.  I was just giving commentary.  I follow his work and he is really working on refining his numbers. 


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: creativex on March 26, 2013, 02:32:47 PM
Fantastic work organofcorti. Thanks for the update.

Tip inbound.


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: organofcorti on March 26, 2013, 02:49:38 PM
Fantastic work organofcorti. Thanks for the update.

Tip inbound.

Thank you very much, creativex. And as a special "thank you" treat, you get:

http://2.bp.blogspot.com/-Mw7v6G66FrI/UVAMBIdasJI/AAAAAAAAHBs/1d_h8lMvCds/s1600/NetworkForecast.2013-03-25.png (http://organofcorti.blogspot.com/2013/03/weekly-network-forecast-25th-march-2013.html)


Title: Re: Long range forecast of the network hashrate and Difficulty
Post by: Rawted on March 27, 2013, 06:07:37 AM
i'm late to the party, but this is fantastic work.