Bitcoin Forum
June 22, 2024, 09:42:54 AM *
News: Voting for pizza day contest
 
  Home Help Search Login Register More  
  Show Posts
Pages: [1]
1  Alternate cryptocurrencies / Altcoin Discussion / Re: NVC giveaway !!! Trade it, use it, enjoy it. Only 2nd after BTC by price on: April 04, 2013, 01:12:49 PM
Very generous of you!

4cYFpVHh1QPCcKoBDNLdXAoWUKbhYYPirT

I appreciate!

---yabadaba---
2  Alternate cryptocurrencies / Altcoin Discussion / Re: Ripple Giveaway! on: April 04, 2013, 01:11:33 PM
rJbDT5nx7ENP5vMXmBBuGmGADYNpFpxQrL
3  Other / Beginners & Help / Re: Earn an UNLIMITED number of free bitcoins. on: January 09, 2012, 08:33:15 PM
I believe the word "unlimited" cannot be applied to BTC as they have a hard cap of maximum 21 million. Wink
4  Other / Beginners & Help / Re: My initial Radeon HD 7970 mining benchmarks on: January 09, 2012, 08:32:05 PM
Wait wait wait. Are we sure uint16 is such a good idea? Last time I tried >4 (which was before 2.6, btw, I haven't tested with 2.6), it would crash in the compiler. Also, does anyone have a count on the number of registers per CU? There might not be enough registers to handle that.

I'm not sure if it's a good idea or not so I wanted to measure it Wink GCN has 64KB worth of registers per CU, and like you said I'm not sure if that's enough. The reason for my curiosity was because GCN's compute units each contain 4 x SIMD units with a width of 16 elements (same size as Larrabee & Intel's MIC, coincidentally), and I recall reading somewhere that each of these SIMD units can retire one 16-way instruction every 4 cycles, so those 16element vectors kind of rang out at me. I also wanted to get familiar with the OpenCL bitcoin mining code and thought it would be a neat exercise (which it was!). Nice code by the way.

I can say for sure that 16element vectors DO compile with the drivers that came with the card.

The -ds code dump for 16 element vectors came out nice and clean, although the last few lines where the result is stored in output seem a bit branchy. It looks something like this:

Code:
    if(XG2.s0 == 0x136032ED) { output[Xnonce.s0 & 0xF] = Xnonce.s0; }
    if(XG2.s1 == 0x136032ED) { output[Xnonce.s1 & 0xF] = Xnonce.s1; }
    if(XG2.s2 == 0x136032ED) { output[Xnonce.s2 & 0xF] = Xnonce.s2; }
    ...
    ...
    if(XG2.sd == 0x136032ED) { output[Xnonce.sd & 0xF] = Xnonce.sd; }
    if(XG2.se == 0x136032ED) { output[Xnonce.se & 0xF] = Xnonce.se; }
    if(XG2.sf == 0x136032ED) { output[Xnonce.sf & 0xF] = Xnonce.sf; }

I tried replacing it with a branch-less expression using shuffle() and vstore16() but haven't managed to get it working. What I've come up with looks something like this:

Code:
    x mask = Xnonce & 0xF;
    x temp = shuffle(select(Xnonce, 0, selection), mask);
    vstore16(temp, 0, output);

Anyhow I'm sure that my code modifications are doing all sorts of dumb things. I'm still learning how it all works so please ignore.

Also, check some of the larger -vs, -v 40 is two sets of uint4 and -v 44 does three uint4s (unlike cgminer, -v 4 does two uint2s).

I've tried all of the different -v settings available (according to the source) but haven't been able to get any higher than the 666MH/s with the default settings and 3 compute threads.

The branching has ended up becoming the best outcome. It can evaluate those branches in parallel, and you can't easily optimize away branches for memory writes (and theres apparently like 2 or 3 good tricks to get rid of branch waste, its just none of them work on memory writes).

I should look at shuffle. Your way doesn't quite work though, vstore would output H !=0 hashes, which would trigger HW error alerts (and rightfully so) in the host code, and I consider the HW error tracking important. At least, assuming I'm reading that code right, anyways.

I'll go add official 8 and 16 wide support in a bit, should be useful on, say, AVX if you manually enable CPU mining in the code. SDK 2.6's cpu compiler apparently has gotten a lot better from what I've heard.

So does that mean that is the best for 5870 cards ? Or stick to 2.1 or 2.4 ? I am quite confused as to what the best SDK / ati driver combo is ATM.
5  Other / Beginners & Help / Re: GPU temp on: January 09, 2012, 08:30:11 PM
105 degrees is too damn high LOL.

I always aim to keep it below 80 and ideally at 60 degrees to conserve card lifetime !
6  Other / Beginners & Help / Re: Newbie restrictions on: January 09, 2012, 08:29:12 PM
I think the restrictions are fair so as to be able to prevent spammers and other bad individuals.

Hurray for forum staff and admins !
7  Other / Beginners & Help / Re: Good/best online wallet? on: January 09, 2012, 08:27:25 PM
As much as the 1% transaction fee (max. 1 btc fee) bites, I mainly use StrongCoin for my online wallet, since I keep control of the private keys. I haven't tried blockchain.info yet.

You could probably recommend to the owner of StrongCoin that he implement namecoin usage. Surely the coding is similar enough to be worth the trouble.


I like that service very much !

They cannot run away with the private keys like mybitcoin etc. !
8  Other / Beginners & Help / Re: Bandwidth on: January 09, 2012, 08:26:21 PM
This just shows that we need to make a thin BTC client or at least make the whole thing more scalable :

-too big to download 1.2 GB over internet for some
-too big to store 1.2 GB and growing on HDD
Pages: [1]
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!