Bitcoin Forum
April 24, 2024, 06:25:09 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: 1 2 3 [All]
  Print  
Author Topic: CUDA Optimalized BTC miner for NVIDIA cards  (Read 7694 times)
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 25, 2013, 02:52:42 PM
Last edit: May 26, 2013, 06:23:51 AM by icedev576
 #1

I`ve started to optimalize the rpc-miner CUDA version. Currently it runs on Linux (Ubuntu 12.04), and tested on 2 different cards:
Asus GTX-590: 260 MH/s (no OC)
Asus GTX-690: 230 MH/s (no OC)
It is about 15-30% faster then the original.
I started this topic to discuss is it worth it to continue this project or no one interested in...

I am intersted in your current hash rates on newer nvidia cards also.

I accept donations for this project in the following BTC address: 1GgnBSzLq6DioYS4haPSx6gZCeL9KMUYFW.

It will be opensource.
1713939909
Hero Member
*
Offline Offline

Posts: 1713939909

View Profile Personal Message (Offline)

Ignore
1713939909
Reply with quote  #2

1713939909
Report to moderator
1713939909
Hero Member
*
Offline Offline

Posts: 1713939909

View Profile Personal Message (Offline)

Ignore
1713939909
Reply with quote  #2

1713939909
Report to moderator
"In a nutshell, the network works like a distributed timestamp server, stamping the first transaction to spend a coin. It takes advantage of the nature of information being easy to spread but hard to stifle." -- Satoshi
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713939909
Hero Member
*
Offline Offline

Posts: 1713939909

View Profile Personal Message (Offline)

Ignore
1713939909
Reply with quote  #2

1713939909
Report to moderator
1713939909
Hero Member
*
Offline Offline

Posts: 1713939909

View Profile Personal Message (Offline)

Ignore
1713939909
Reply with quote  #2

1713939909
Report to moderator
1713939909
Hero Member
*
Offline Offline

Posts: 1713939909

View Profile Personal Message (Offline)

Ignore
1713939909
Reply with quote  #2

1713939909
Report to moderator
gaborjakab
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
May 26, 2013, 06:37:29 AM
 #2

Hi!

I have an overclocked 680 and have only 135 Mhash/s. I would love to see if your modification helps a bit. By the way I am using on older Ubuntu. When will it be available?
roy7
Sr. Member
****
Offline Offline

Activity: 434
Merit: 250


View Profile
May 26, 2013, 06:48:09 AM
 #3

You might find these helpful

https://bitcointalk.org/index.php?topic=163750.0
https://bitcointalk.org/index.php?topic=167229.0
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 26, 2013, 07:44:26 AM
 #4

This is interesting. I am reading the post, maybe I can mix the solutions together.
klintay
Legendary
*
Offline Offline

Activity: 1775
Merit: 1032


Value will be measured in sats


View Profile WWW
May 26, 2013, 08:08:32 AM
 #5

yeah i would be interest too in this!
smoothie
Legendary
*
Offline Offline

Activity: 2492
Merit: 1473


LEALANA Bitcoin Grim Reaper


View Profile
May 26, 2013, 08:44:51 AM
 #6

Hopefully this hasn't already been optimized. But if so, good job.

███████████████████████████████████████

            ,╓p@@███████@╗╖,           
        ,p████████████████████N,       
      d█████████████████████████b     
    d██████████████████████████████æ   
  ,████²█████████████████████████████, 
 ,█████  ╙████████████████████╨  █████y
 ██████    `████████████████`    ██████
║██████       Ñ███████████`      ███████
███████         ╩██████Ñ         ███████
███████    ▐▄     ²██╩     a▌    ███████
╢██████    ▐▓█▄          ▄█▓▌    ███████
 ██████    ▐▓▓▓▓▌,     ▄█▓▓▓▌    ██████─
           ▐▓▓▓▓▓▓█,,▄▓▓▓▓▓▓▌          
           ▐▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▌          
    ▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓─  
     ²▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓╩    
        ▀▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▀       
           ²▀▀▓▓▓▓▓▓▓▓▓▓▓▓▀▀`          
                   ²²²                 
███████████████████████████████████████

. ★☆ WWW.LEALANA.COM        My PGP fingerprint is A764D833.                  History of Monero development Visualization ★☆ .
LEALANA BITCOIN GRIM REAPER SILVER COINS.
 
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 26, 2013, 10:04:29 AM
 #7

My code is a mess now, so it isn`t ready for a public release, but because gaborjakab uses almost the same hardware, and OS like me, I can send a binary from the current stage just for the test. So if you like please send a private message.
LighningChun
Jr. Member
*
Offline Offline

Activity: 56
Merit: 10


View Profile
May 26, 2013, 11:16:16 AM
 #8

i have a gtx card, looking forward for this!  Grin
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 26, 2013, 11:51:06 AM
 #9

Well, I read the whole "NVIDIA Kepler (K20) from 134MHash/s to 330MHash/s with CUDA" topic. As far I know psychocoder has optimalized the cuda code for the GK110 architecture (sm_35 in cuda). There was only 2 card supporting this: Tesla K20, Geforce Titan, but from yesterday you can buy the Geforce 780! It is the same arch, a little cheaper and more consumer. If I had enough donations, I will buy one.

My solution is speeding up the older generations: Fermi and Kepler(sm_20, sm_30), that means geforce 4xx to 6xx. So I hope I can merge them together.

I sent a binary to gaborjakab, others please wait and keep me posted.
gaborjakab
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
May 26, 2013, 02:06:36 PM
 #10

Thanks man, I have tried it, now it goes with about 150 MHash/s, about 10% faster then before. It is not a miracle, but it is something.
thomassie12
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
May 26, 2013, 02:38:41 PM
 #11

I used a gtx 660 (non Ti) i couldnt get it higher then 73 mhash/s
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 26, 2013, 05:51:40 PM
 #12

Thomassie12 are you still mining or you give up? What kind of miner do you use or used?
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
May 26, 2013, 10:46:59 PM
 #13

I'm currently using a EVGA GTX 670 with cgminer and I get 91-95 MH/s. Seems low, but I don't have any sort of optimizations for it just yet. So I'm very interested to see what you can achieve with this.

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 27, 2013, 05:14:37 AM
 #14

HI Kenshin23!

Can you give me some information: what OS you use is it 64bit win or linux(distro?)? And the most important are you using rpcminer or something else?
Cyberclones
Newbie
*
Offline Offline

Activity: 42
Merit: 0



View Profile
May 27, 2013, 05:47:22 AM
 #15

I started by running a EVGA GTX 550 Ti 1 GB was getting about 45Mhash/s without a overclock on it but I no longer run it
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 27, 2013, 02:15:48 PM
 #16

I`ve managed to merge the source. Tomorrow I will upload the source code to github. I am also thinking to create an cuda mining survival guide to add some help for nvidia card owners.
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
May 27, 2013, 03:03:28 PM
 #17

HI Kenshin23!

Can you give me some information: what OS you use is it 64bit win or linux(distro?)? And the most important are you using rpcminer or something else?
Hi, sure! Win 7 64 bit and currently using cgminer 3.1.1 as-is, but willing to switch to another miner if it means it'll be more efficient.

The CUDA mining survival guide sounds great; looking forward to it.

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
OnLashoc
Newbie
*
Offline Offline

Activity: 26
Merit: 0



View Profile
May 27, 2013, 03:50:55 PM
 #18

I'm in same boat you are ^^
xh4
Newbie
*
Offline Offline

Activity: 12
Merit: 0


View Profile
May 27, 2013, 03:52:59 PM
 #19

Looking forward to using all my old graphics cards for something useful...
gaborjakab
Newbie
*
Offline Offline

Activity: 3
Merit: 0


View Profile
May 28, 2013, 03:41:57 PM
 #20

Any news from here? I am still waiting the sourcecode.
Devnant
Member
**
Offline Offline

Activity: 118
Merit: 10


View Profile
May 28, 2013, 04:50:27 PM
 #21

Hey man! Very interested in testing your code on a GTX TITAN!

Game is my life
gourmet
Sr. Member
****
Offline Offline

Activity: 311
Merit: 250


View Profile
May 28, 2013, 05:01:57 PM
 #22

I`ve managed to merge the source. Tomorrow I will upload the source code to github. I am also thinking to create an cuda mining survival guide to add some help for nvidia card owners.

Looking forward to improve my GTX 580 performance at least a bit. It's not my main worker, but there's no reason to leave its power unused.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 28, 2013, 05:40:32 PM
 #23

Hey man! Very interested in testing your code on a GTX TITAN!

Sorry currently I am only optimalized for 4xx to 6xx, the titan is a different issue. maybe later
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 28, 2013, 05:42:30 PM
 #24

Any news from here? I am still waiting the sourcecode.

Sorry I am little busy at my job, I don`t have time for this today. I`ve created the github repo, but it is relatively empty now.
LuckyXIII
Newbie
*
Offline Offline

Activity: 17
Merit: 0



View Profile
May 28, 2013, 06:21:35 PM
 #25

Has anyone actually gotten an Nvidia card to mine efficiently enough to pay for it's own power consumption? Or are these solely for people mining with "free" electricity?
R3dcell
Member
**
Offline Offline

Activity: 78
Merit: 10



View Profile
May 28, 2013, 06:29:19 PM
 #26

very interesting, any info as to whether this would increase the hash rate on my GTX 660?

Graciously accepting donations from you kind Sirs Smiley 1GEAhoDdL8Vcu57PRJfRrpQNBdzBsLuFrG
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 29, 2013, 05:40:01 AM
 #27

Source has been released. You can found here:
https://github.com/icedev576/rpcminer-optcuda

Please follow the instructions, to build and run in: README, README.md, HOW-TO files

Anyone who was able to build any variation of rpcminer, can this also.

Compiled Binaries for different OS, may come later.
djmagicmetz
Newbie
*
Offline Offline

Activity: 5
Merit: 0


View Profile
May 29, 2013, 05:53:21 AM
 #28

Asics are going to fry these things. Game over.
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
May 29, 2013, 05:57:18 AM
 #29

Thanks for this! I'm downloading the CUDA toolkit to see if I can compile it for my system. Otherwise, I'll wait for the binaries.

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 29, 2013, 08:03:29 AM
 #30

Has anyone actually gotten an Nvidia card to mine efficiently enough to pay for it's own power consumption? Or are these solely for people mining with "free" electricity?

I think the short answer is no. I don`t think anyone bought an nvidia card just for mining. I think (just like me) we got nv card because other reasons (for me, I use it for work). In the computer spare time, mining can run, but it`s not worth unless you have "free" electricity.
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
May 29, 2013, 03:11:14 PM
 #31

Has anyone actually gotten an Nvidia card to mine efficiently enough to pay for it's own power consumption? Or are these solely for people mining with "free" electricity?

I think the short answer is no. I don`t think anyone bought an nvidia card just for mining. I think (just like me) we got nv card because other reasons (for me, I use it for work). In the computer spare time, mining can run, but it`s not worth unless you have "free" electricity.
In my particular case, I had an ATI HD 5770 but it sucked at flightsimming (which is what I built my computer for) so I sold that (stupid, I know, but I really didn't know much about bitcoins then) and bought the 670 I'm now using. It runs FSX great, but now I got interested in bitcoins and found out that the card I sold can hash at least twice as much than what the 670 can do normally. I can't really undo the sale, and buying an ATI card that can mine efficiently is more or less out of the question right now.

Hence, my interest in optimizing my card's performance. I also know that GPU mining is almost dead, but it's worth a shot, I guess.

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
gourmet
Sr. Member
****
Offline Offline

Activity: 311
Merit: 250


View Profile
May 29, 2013, 03:13:30 PM
Last edit: May 29, 2013, 03:33:15 PM by gourmet
 #32

Source has been released. You can found here:
https://github.com/icedev576/rpcminer-optcuda

Please follow the instructions, to build and run in: README, README.md, HOW-TO files

Anyone who was able to build any variation of rpcminer, can this also.

Compiled Binaries for different OS, may come later.

I've compiled cgminer on Ubuntu recently.
But there is Radeon card on that machine.
Never have been using rpc-miner. (Is there
any big difference in compilation/usage?)
My Nvidia is in a machine running Windows.
So I'll wait for Windows binaries, too.

[edit]
I'm running guiminer there now. I'm really
curious what the difference may be. :-)
dentldir
Sr. Member
****
Offline Offline

Activity: 333
Merit: 250



View Profile
May 30, 2013, 08:29:41 AM
 #33

Testing a 660ti with this right now, seeing about 104MH/s on stock clocks.

cgminer comes in at 100MH/s.  Both are steady, so it is a solid 4% improvement without any tweaking.

The same card, on scrypt using cudaminer, now gets around 160KH/s after several kernel iterations.  I don't know if there is anything instructive inside the spinlock kernels in cudaminer 4-30, but it might be worth a look.

Cheers and thanks for the good work.


1DentLdiRMv3dpmpmqWsQev8BUaty9vN3v
minehe
Newbie
*
Offline Offline

Activity: 41
Merit: 0


View Profile
May 30, 2013, 09:06:51 AM
 #34

260 MH/s is too slow.
dentldir
Sr. Member
****
Offline Offline

Activity: 333
Merit: 250



View Profile
May 30, 2013, 09:15:09 AM
 #35

On that 660ti, the auto config found:

Best configuration step found (-gpugrid=448,-gputhreads=1024) 104206 KHs  prev best=105422 KHs
Done allocating CUDA resources for (448,1024)

on a second run, and settled in at 105MH/s.

1DentLdiRMv3dpmpmqWsQev8BUaty9vN3v
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
May 30, 2013, 03:48:38 PM
 #36

On that 660ti, the auto config found:

Best configuration step found (-gpugrid=448,-gputhreads=1024) 104206 KHs  prev best=105422 KHs
Done allocating CUDA resources for (448,1024)

on a second run, and settled in at 105MH/s.

Are you using that card exclusively to mine, or do you have it configured for normal desktop usage?

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
dentldir
Sr. Member
****
Offline Offline

Activity: 333
Merit: 250



View Profile
May 30, 2013, 07:41:24 PM
 #37

It has an X server running on it, but I'm not using it as a desktop.  It was a default Ubuntu 12.04 install running the driver in the CUDA 5.0 package.  (304.86?).

1DentLdiRMv3dpmpmqWsQev8BUaty9vN3v
Cyberburner
Newbie
*
Offline Offline

Activity: 14
Merit: 0


View Profile
May 30, 2013, 09:15:19 PM
 #38

Problem with all these programs is you have to be a technician to be able to work it. The first one to release some more user friendly things, that actually work, will be my bitcoin jezus...
cyberkiller
Newbie
*
Offline Offline

Activity: 43
Merit: 0


View Profile WWW
May 30, 2013, 09:37:18 PM
 #39

interesting, definitely a good thing. Everyone is so focused on AMD cards, yes rightfully so as they are faster but people have tons of nvidia cards laying around.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 31, 2013, 07:06:42 AM
 #40

I have written instructions in the readme files, but I write here some important information:
 - Use 4.2 if you compile the cu files. CU files should be compiled as a second pass and compiled next to the executable
 - I've added precompiled cubin files to the project, it is OS independent so you can copy it next to the executable file, if you don't want to compile the cu files
 - The program has an auto-tuning feature, if you don't define the gpuGrid, and gputhreads, it will find the best configuration, it is a wider search then the original (it can take several minutes!)
 - Once it is done you can use it next time
 - It does't have native stratum support, use a proxy instead
 - For measuring the time please run it for at least a few minutes
nightyj
Member
**
Offline Offline

Activity: 89
Merit: 10


View Profile
May 31, 2013, 08:04:41 AM
 #41

With this miner I manage to get from my GTX480 from 110 till 140 Mh/s at default, thanks. 
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
May 31, 2013, 02:36:20 PM
 #42

With this miner I manage to get from my GTX480 from 110 till 140 Mh/s at default, thanks. 

You welcome!
Eli0t
Sr. Member
****
Offline Offline

Activity: 252
Merit: 250


View Profile
May 31, 2013, 03:13:24 PM
 #43

would be great to get some more MH out of my 560ti, currently 83 MH

waiting for win7 64bit binaries

have you fixed rpc-miner-cuda so it doesnt crash the display driver when closing it, or was that a windows issue

LTC:  LKpJf3uk7KsHU73kxq8iFJrP1AAKN7Yni7  DGC:  DKXGvEbj3Rwgrm2QQbRyNPDDZDYoq4Y44d  XPM:  AWV5AKfLFyoBaMjg9C77rGUBhuFxz5DGGL
zhuravno
Newbie
*
Offline Offline

Activity: 9
Merit: 0


View Profile
May 31, 2013, 03:27:41 PM
 #44

what performance increase is expected compared to guiminer or cgminer?
llieco
Newbie
*
Offline Offline

Activity: 12
Merit: 0


View Profile
May 31, 2013, 03:34:10 PM
 #45

260 MH/s to slow. Embarrassed
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 03, 2013, 05:18:48 AM
 #46

what performance increase is expected compared to guiminer or cgminer?

Well actually CG miner is quite good, I don't expect more then 10%. For Titan and GTX780, if somebody will do some optimalization CUDA can be better. If I have enough money I'll buy a 780, and try it.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 03, 2013, 05:20:30 AM
 #47

260 MH/s to slow. Embarrassed

Unfortunately NV cards not so good for mining. Buy AMD or some asic if you want more or you can design a new asic.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 03, 2013, 05:21:55 AM
 #48

would be great to get some more MH out of my 560ti, currently 83 MH

waiting for win7 64bit binaries

have you fixed rpc-miner-cuda so it doesnt crash the display driver when closing it, or was that a windows issue

Well, I don't know, I have dedicated cards for CUDA (no display or X attached to it)
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
June 07, 2013, 05:40:58 AM
 #49

Any update on binaries for Windows? I've tried compiling with Cygwin, but cmake complains it can't find CUDA_CUDART_LIBRARY.

Code:
$ cmake ../
-- The C compiler identification is GNU 4.5.3
-- The CXX compiler identification is GNU 4.5.3
CMake Warning at /usr/share/cmake-2.8.9/Modules/Platform/CYGWIN.cmake:15 (message):
  CMake no longer defines WIN32 on Cygwin!

  (1) If you are just trying to build this project, ignore this warning or
  quiet it by setting CMAKE_LEGACY_CYGWIN_WIN32=0 in your environment or in
  the CMake cache.  If later configuration or build errors occur then this
  project may have been written under the assumption that Cygwin is WIN32.
  In that case, set CMAKE_LEGACY_CYGWIN_WIN32=1 instead.

  (2) If you are developing this project, add the line

    set(CMAKE_LEGACY_CYGWIN_WIN32 0) # Remove when CMake >= 2.8.4 is required

  at the top of your top-level CMakeLists.txt file or set the minimum
  required version of CMake to 2.8.4 or higher.  Then teach your project to
  build on Cygwin without WIN32.
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/CMakeSystemSpecificInformation.cmake:36 (INCLUDE)
  CMakeLists.txt:3 (PROJECT)


-- Check for working C compiler: /usr/bin/gcc.exe
-- Check for working C compiler: /usr/bin/gcc.exe -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++.exe
-- Check for working CXX compiler: /usr/bin/c++.exe -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Boost version: 1.53.0
-- Found the following Boost libraries:
--   date_time
--   filesystem
--   program_options
--   regex
--   system
--   thread
-- Found OpenSSL: /usr/lib/libssl.dll.a;/usr/lib/libcrypto.dll.a (found version "1.0.1e")
-- Looking for include file pthread.h
-- Looking for include file pthread.h - found
-- Looking for pthread_create
-- Looking for pthread_create - found
-- Found Threads: TRUE
CMake Error at /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:97 (MESSAGE):
  Could NOT find CUDA (missing: CUDA_CUDART_LIBRARY)
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:288 (_FPHSA_FAILURE_MESSAGE)
  cmake-modules/FindCUDA.cmake:682 (find_package_handle_standard_args)
  CMakeLists.txt:32 (FIND_PACKAGE)


-- Configuring incomplete, errors occurred!

If anyone has any how-to's or some kind of guidance on how to get this working, please share.

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
mikal
Newbie
*
Offline Offline

Activity: 39
Merit: 0


View Profile
June 07, 2013, 07:20:20 PM
 #50

Can you explain the increase of hash rate? I thought the original miner got all could be achieved on sm30...
I'll start digging into the code, but I'd like to read high-level explanation.
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 12, 2013, 07:17:16 AM
 #51

Any update on binaries for Windows? I've tried compiling with Cygwin, but cmake complains it can't find CUDA_CUDART_LIBRARY.

Code:
$ cmake ../
-- The C compiler identification is GNU 4.5.3
-- The CXX compiler identification is GNU 4.5.3
CMake Warning at /usr/share/cmake-2.8.9/Modules/Platform/CYGWIN.cmake:15 (message):
  CMake no longer defines WIN32 on Cygwin!

  (1) If you are just trying to build this project, ignore this warning or
  quiet it by setting CMAKE_LEGACY_CYGWIN_WIN32=0 in your environment or in
  the CMake cache.  If later configuration or build errors occur then this
  project may have been written under the assumption that Cygwin is WIN32.
  In that case, set CMAKE_LEGACY_CYGWIN_WIN32=1 instead.

  (2) If you are developing this project, add the line

    set(CMAKE_LEGACY_CYGWIN_WIN32 0) # Remove when CMake >= 2.8.4 is required

  at the top of your top-level CMakeLists.txt file or set the minimum
  required version of CMake to 2.8.4 or higher.  Then teach your project to
  build on Cygwin without WIN32.
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/CMakeSystemSpecificInformation.cmake:36 (INCLUDE)
  CMakeLists.txt:3 (PROJECT)


-- Check for working C compiler: /usr/bin/gcc.exe
-- Check for working C compiler: /usr/bin/gcc.exe -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++.exe
-- Check for working CXX compiler: /usr/bin/c++.exe -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Boost version: 1.53.0
-- Found the following Boost libraries:
--   date_time
--   filesystem
--   program_options
--   regex
--   system
--   thread
-- Found OpenSSL: /usr/lib/libssl.dll.a;/usr/lib/libcrypto.dll.a (found version "1.0.1e")
-- Looking for include file pthread.h
-- Looking for include file pthread.h - found
-- Looking for pthread_create
-- Looking for pthread_create - found
-- Found Threads: TRUE
CMake Error at /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:97 (MESSAGE):
  Could NOT find CUDA (missing: CUDA_CUDART_LIBRARY)
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:288 (_FPHSA_FAILURE_MESSAGE)
  cmake-modules/FindCUDA.cmake:682 (find_package_handle_standard_args)
  CMakeLists.txt:32 (FIND_PACKAGE)


-- Configuring incomplete, errors occurred!

If anyone has any how-to's or some kind of guidance on how to get this working, please share.

Is your CUDA_TOOLKIT_ROOT_DIR environment variable set?
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 12, 2013, 07:26:35 AM
 #52

Can you explain the increase of hash rate? I thought the original miner got all could be achieved on sm30...
I'll start digging into the code, but I'd like to read high-level explanation.

Everything about the used register-count. If a CUDA code uses too many registers 2 things can happen: you can run the code on less threads, or you can tell the nvcc compiler to force use less register. There is no magic if your code needs 72 register, but you force the compiler use only 63 will mean 9 register will be in  local memory (available via spill loads), and this is slower then the registers.

So optimizing a CUDA code usually means write a code that can run with less register. On the other hand you can use a lot of special functions and types (I moved the input storing to __constant__ memory ). Memory reading and writing is slow, avoid unnecessary memory ops. And there are a lot of other tricks...
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 12, 2013, 09:50:12 AM
 #53

Code now works on cards with compute mode 3.5 (Titan, GTX780).
Tested on GTX-780 :-) It is about 275 MH/s.
kenshin23
Member
**
Offline Offline

Activity: 66
Merit: 10



View Profile
June 12, 2013, 05:08:34 PM
 #54

Any update on binaries for Windows? I've tried compiling with Cygwin, but cmake complains it can't find CUDA_CUDART_LIBRARY.

Code:
$ cmake ../
-- The C compiler identification is GNU 4.5.3
-- The CXX compiler identification is GNU 4.5.3
CMake Warning at /usr/share/cmake-2.8.9/Modules/Platform/CYGWIN.cmake:15 (message):
  CMake no longer defines WIN32 on Cygwin!

  (1) If you are just trying to build this project, ignore this warning or
  quiet it by setting CMAKE_LEGACY_CYGWIN_WIN32=0 in your environment or in
  the CMake cache.  If later configuration or build errors occur then this
  project may have been written under the assumption that Cygwin is WIN32.
  In that case, set CMAKE_LEGACY_CYGWIN_WIN32=1 instead.

  (2) If you are developing this project, add the line

    set(CMAKE_LEGACY_CYGWIN_WIN32 0) # Remove when CMake >= 2.8.4 is required

  at the top of your top-level CMakeLists.txt file or set the minimum
  required version of CMake to 2.8.4 or higher.  Then teach your project to
  build on Cygwin without WIN32.
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/CMakeSystemSpecificInformation.cmake:36 (INCLUDE)
  CMakeLists.txt:3 (PROJECT)


-- Check for working C compiler: /usr/bin/gcc.exe
-- Check for working C compiler: /usr/bin/gcc.exe -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++.exe
-- Check for working CXX compiler: /usr/bin/c++.exe -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Boost version: 1.53.0
-- Found the following Boost libraries:
--   date_time
--   filesystem
--   program_options
--   regex
--   system
--   thread
-- Found OpenSSL: /usr/lib/libssl.dll.a;/usr/lib/libcrypto.dll.a (found version "1.0.1e")
-- Looking for include file pthread.h
-- Looking for include file pthread.h - found
-- Looking for pthread_create
-- Looking for pthread_create - found
-- Found Threads: TRUE
CMake Error at /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:97 (MESSAGE):
  Could NOT find CUDA (missing: CUDA_CUDART_LIBRARY)
Call Stack (most recent call first):
  /usr/share/cmake-2.8.9/Modules/FindPackageHandleStandardArgs.cmake:288 (_FPHSA_FAILURE_MESSAGE)
  cmake-modules/FindCUDA.cmake:682 (find_package_handle_standard_args)
  CMakeLists.txt:32 (FIND_PACKAGE)


-- Configuring incomplete, errors occurred!

If anyone has any how-to's or some kind of guidance on how to get this working, please share.

Is your CUDA_TOOLKIT_ROOT_DIR environment variable set?
It wasn't, but I just set it prior to compilation and the problem persists. Any other clues? Or is cygwin not appropriate for compiling this? (might have to make a liveUSB with linux just for mining...)

Donations are very welcome! 1LJ7k5b71cYFrBVQ7zYSBJePZ7EuFBFr5a
Pyramining referral links: http://pyramining.com/referral/be4kfg2m7 | http://pyramining.com/referral/yghmk7r9f
Bitrated user: kenshin23.
Eli0t
Sr. Member
****
Offline Offline

Activity: 252
Merit: 250


View Profile
June 12, 2013, 05:25:33 PM
 #55

Code now works on cards with compute mode 3.5 (Titan, GTX780).
Tested on GTX-780 :-) It is about 275 MH/s.
any hope for compute 2.1? also how does that rate vs your cgminer speed

LTC:  LKpJf3uk7KsHU73kxq8iFJrP1AAKN7Yni7  DGC:  DKXGvEbj3Rwgrm2QQbRyNPDDZDYoq4Y44d  XPM:  AWV5AKfLFyoBaMjg9C77rGUBhuFxz5DGGL
icedev576 (OP)
Newbie
*
Offline Offline

Activity: 21
Merit: 0


View Profile
June 24, 2013, 12:09:41 PM
 #56

Code now works on cards with compute mode 3.5 (Titan, GTX780).
Tested on GTX-780 :-) It is about 275 MH/s.
any hope for compute 2.1? also how does that rate vs your cgminer speed

UPDATE:
Automatic GPU architecture chooser was added it works on any architecture from 1.0 to 3.5. The relevant cubin file should be in cmake-rpcminer. I added these files on github, so use them.
Eli0t
Sr. Member
****
Offline Offline

Activity: 252
Merit: 250


View Profile
June 24, 2013, 07:53:57 PM
 #57

Code now works on cards with compute mode 3.5 (Titan, GTX780).
Tested on GTX-780 :-) It is about 275 MH/s.
any hope for compute 2.1? also how does that rate vs your cgminer speed

UPDATE:
Automatic GPU architecture chooser was added it works on any architecture from 1.0 to 3.5. The relevant cubin file should be in cmake-rpcminer. I added these files on github, so use them.
i guess the difficulty is too high now, but if you could include all the relevant information from the whole thread in the first post and possibly some windows binaries im sure it would help someone out.

think i missed some words from my last post. what is your rpcminer-optcuda MH vs your cgminer MH ?

LTC:  LKpJf3uk7KsHU73kxq8iFJrP1AAKN7Yni7  DGC:  DKXGvEbj3Rwgrm2QQbRyNPDDZDYoq4Y44d  XPM:  AWV5AKfLFyoBaMjg9C77rGUBhuFxz5DGGL
gourmet
Sr. Member
****
Offline Offline

Activity: 311
Merit: 250


View Profile
June 24, 2013, 10:12:57 PM
 #58

Also looking forward to Windows binaries and the comparison with cgminer. ;-)
Pages: 1 2 3 [All]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!