Bitcoin Forum
April 19, 2024, 04:09:24 PM *
News: Latest Bitcoin Core release: 26.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 [2]  All
  Print  
Author Topic: A slightly more open approach to bitcoin on the GPU  (Read 18951 times)
Syke
Legendary
*
Offline Offline

Activity: 3878
Merit: 1193


View Profile
September 17, 2010, 01:29:03 AM
 #21

nelisky, I'll provide a card. Check your PMs.

Buy & Hold
1713542964
Hero Member
*
Offline Offline

Posts: 1713542964

View Profile Personal Message (Offline)

Ignore
1713542964
Reply with quote  #2

1713542964
Report to moderator
If you want to be a moderator, report many posts with accuracy. You will be noticed.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713542964
Hero Member
*
Offline Offline

Posts: 1713542964

View Profile Personal Message (Offline)

Ignore
1713542964
Reply with quote  #2

1713542964
Report to moderator
mizerydearia
Hero Member
*****
Offline Offline

Activity: 574
Merit: 507



View Profile
September 17, 2010, 06:47:40 AM
 #22

Everyone developing this GPU thing seems to have an agenda.

Maybe noagendamarket should do it! ^_^
puddinpop
Member
**
Offline Offline

Activity: 103
Merit: 17


View Profile
September 22, 2010, 09:42:48 PM
 #23

I just tried your latest patch.  I noticed a few things:

  • Your post, and the patch name, makes it seem like the patch is against revision 157, which does not yet exist.  Upon looking at the content of the patch, it is obvious the patch is against revision 155.
  • You have an extraneous curly brace on line ~3077 when FOURWAYSSE2 is not defined
  • I get about 6200 khash/s with your patch using the GPU only (limit set to 1 CPU).  However without the CPU limit, and using 2 CPUs, I only get 6500 khash/s according to the counter.  That can't be right.

nelisky (OP)
Legendary
*
Offline Offline

Activity: 1540
Merit: 1001


View Profile
September 22, 2010, 10:02:03 PM
 #24

I just tried your latest patch.  I noticed a few things:

  • Your post, and the patch name, makes it seem like the patch is against revision 157, which does not yet exist.  Upon looking at the content of the patch, it is obvious the patch is against revision 155.
  • You have an extraneous curly brace on line ~3077 when FOURWAYSSE2 is not defined
  • I get about 6200 khash/s with your patch using the GPU only (limit set to 1 CPU).  However without the CPU limit, and using 2 CPUs, I only get 6500 khash/s according to the counter.  That can't be right.

Thanks for trying it out, finally some feedback Smiley

You are correct about the revision number being wrong. It was actually against r154. The fact I did a hacky job at including the miner means that problems do exist. I find that using all the cores + the GPU actually slows everything, and that reducing the number of mining threads on the fly doesn't do the right thing. One has to, as best as I can understand, start with 1 and up to 2 or more, but reducing threads leaves the system in unexpected states.

As for using a limit of -1 slowing things down, it makes some sense, as the CPU still is at 100% when using the GPU, doing memcpy's and finding nonces that are potential winners.

I'm attaching a new version that unrolls pretty much everything and nets me an extra 1MH/s, but this is finely tuned for my own system. This comes from a slightly more patched code base, so I hope I didn't leave any extra stuff.
nelisky (OP)
Legendary
*
Offline Offline

Activity: 1540
Merit: 1001


View Profile
September 22, 2010, 10:14:46 PM
 #25

I just tried your latest patch.  I noticed a few things:

  • Your post, and the patch name, makes it seem like the patch is against revision 157, which does not yet exist.  Upon looking at the content of the patch, it is obvious the patch is against revision 155.
  • You have an extraneous curly brace on line ~3077 when FOURWAYSSE2 is not defined
  • I get about 6200 khash/s with your patch using the GPU only (limit set to 1 CPU).  However without the CPU limit, and using 2 CPUs, I only get 6500 khash/s according to the counter.  That can't be right.

Hey puddinpop, I'm curious. On what OS are you compiling this? If it's not OSX, care to share the makefile? Also, what is you graphics card and how does my patch compare to your version on it? I got about 20% increase on mine compared to yours, but then again some things like the threads per block or the total number of blocks have a huge impact on the overal performance, and I didn't try to optimize your version.
nelisky (OP)
Legendary
*
Offline Offline

Activity: 1540
Merit: 1001


View Profile
September 27, 2010, 02:55:22 PM
 #26

I just tried your latest patch.  I noticed a few things:

  • Your post, and the patch name, makes it seem like the patch is against revision 157, which does not yet exist.  Upon looking at the content of the patch, it is obvious the patch is against revision 155.
  • You have an extraneous curly brace on line ~3077 when FOURWAYSSE2 is not defined
  • I get about 6200 khash/s with your patch using the GPU only (limit set to 1 CPU).  However without the CPU limit, and using 2 CPUs, I only get 6500 khash/s according to the counter.  That can't be right.

Hey puddinpop, I'm curious. On what OS are you compiling this? If it's not OSX, care to share the makefile? Also, what is you graphics card and how does my patch compare to your version on it? I got about 20% increase on mine compared to yours, but then again some things like the threads per block or the total number of blocks have a huge impact on the overal performance, and I didn't try to optimize your version.

I'm sure puddinpop has good reasons to not reply to my questions, but it feels weird. It's almost like he has something against me. Do you, puddinpop? Can't we just be friends? Wink

Anyhow, no joke, I have generated my first real block using the CUDA miner! Woohoo, 50 coins accounted for, *now* it was all worth it, the sleepless nights, the complaining family due to my lack of attention, everything!

If I get another block I may start developing an opencl version, while I'm in profit! That is, if puddinpop didn't make one first and sold it for 20k coins
GeorgeH
Member
**
Offline Offline

Activity: 83
Merit: 10


View Profile
October 07, 2010, 02:57:09 PM
 #27

Has anyone built this for windows?

1DSpPtPTGXTYjkZehPsiAbjkXLkB1jsZ2x
em3rgentOrdr
Sr. Member
****
Offline Offline

Activity: 434
Merit: 251


youtube.com/ericfontainejazz now accepts bitcoin


View Profile WWW
October 19, 2010, 08:40:03 PM
 #28

I just tried your latest patch.  I noticed a few things:

  • Your post, and the patch name, makes it seem like the patch is against revision 157, which does not yet exist.  Upon looking at the content of the patch, it is obvious the patch is against revision 155.
  • You have an extraneous curly brace on line ~3077 when FOURWAYSSE2 is not defined
  • I get about 6200 khash/s with your patch using the GPU only (limit set to 1 CPU).  However without the CPU limit, and using 2 CPUs, I only get 6500 khash/s according to the counter.  That can't be right.

Hey puddinpop, I'm curious. On what OS are you compiling this? If it's not OSX, care to share the makefile? Also, what is you graphics card and how does my patch compare to your version on it? I got about 20% increase on mine compared to yours, but then again some things like the threads per block or the total number of blocks have a huge impact on the overal performance, and I didn't try to optimize your version.

I'm sure puddinpop has good reasons to not reply to my questions, but it feels weird. It's almost like he has something against me. Do you, puddinpop? Can't we just be friends? Wink

Anyhow, no joke, I have generated my first real block using the CUDA miner! Woohoo, 50 coins accounted for, *now* it was all worth it, the sleepless nights, the complaining family due to my lack of attention, everything!

If I get another block I may start developing an opencl version, while I'm in profit! That is, if puddinpop didn't make one first and sold it for 20k coins

Nelisky (or anyone else), could you provide the makefile and what system you compiled this on.  I'm really having trouble compiling this.  I have Ubuntu 10.04 with a GFX 9800 Gx2 with the latest Nvidia CUDA drivers installed (and I verified that my CUDA test programs do work). 

If no one wants to provide the makefile, could someone explains how I would go about modifying the existing unix makefile to compile the cuda version?

Also, what specific SVN reversion are we patching against?

"We will not find a solution to political problems in cryptography, but we can win a major battle in the arms race and gain a new territory of freedom for several years.

Governments are good at cutting off the heads of a centrally controlled networks, but pure P2P networks are holding their own."
nelisky (OP)
Legendary
*
Offline Offline

Activity: 1540
Merit: 1001


View Profile
October 19, 2010, 08:48:38 PM
 #29


Nelisky (or anyone else), could you provide the makefile and what system you compiled this on.  I'm really having trouble compiling this.  I have Ubuntu 10.04 with a GFX 9800 Gx2 with the latest Nvidia CUDA drivers installed (and I verified that my CUDA test programs do work). 

If no one wants to provide the makefile, could someone explains how I would go about modifying the existing unix makefile to compile the cuda version?

Also, what specific SVN reversion are we patching against?

It's been a while since I last worked on this, but I have compiled on OSX, against revision 156 and the osx makefile changes are on the patch a few messages above. You will probably need to change the paths to the libs, but should be fairly simple to do so.
LZ
Legendary
*
Offline Offline

Activity: 1722
Merit: 1072


P2P Cryptocurrency


View Profile
January 21, 2011, 05:47:00 PM
 #30

nelisky, can you make r205 patch? Smiley

My OpenPGP fingerprint: 5099EB8C0F2E68C63B4ECBB9A9D0993E04143362
nelisky (OP)
Legendary
*
Offline Offline

Activity: 1540
Merit: 1001


View Profile
January 21, 2011, 06:32:13 PM
 #31

nelisky, can you make r205 patch? Smiley

Not right now, no, I lack the time. But I'm curious, why not use one of the much more optimized by now OpenCL implementations?
LZ
Legendary
*
Offline Offline

Activity: 1722
Merit: 1072


P2P Cryptocurrency


View Profile
January 21, 2011, 06:36:10 PM
 #32

Our strength is in the diversity of solutions. Sometimes your miner is the optimal solution. Smiley

My OpenPGP fingerprint: 5099EB8C0F2E68C63B4ECBB9A9D0993E04143362
Pages: « 1 [2]  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!