Bitcoin Forum
June 28, 2024, 01:51:34 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 [198] 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 ... 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347503 times)
scryptr
Legendary
*
Offline Offline

Activity: 1796
Merit: 1028



View Profile WWW
July 08, 2015, 12:31:23 PM
 #3941

DJM34, SP_ --

I flipped you each a nickle.  Thank you for your hard work!  I hope my 960's will be able to mine Lyra2 on Windows at about 1250kh/s soon, maybe more!

Thanks!       --scryptr

P.S.  I was able to get DJM34's Windows binary to run on my Win 7 x64 system with a 2GB GTX960 SSC with ONLY the performance setting of "-i 16.3".  No other performance settings were used.  Algo, username, password were as standard.

Result:  1175kh/s mining Lyra2       --scryptr
thanks,
don't forget to run at p0 state using nvidia-smi , that gives the possibility to oc the memclock (it will run also at a somewhat)

From what I've seen memory OCs aren't worth it. They give you a tiny bit of extra hash and they wreck your efficiency. Like 5% more hashrate for 15% more power.
well they are for memory hard algo as it decreases the frame buffer usage... (meaning less bottleneck at that level)
and if you have a large number of cards, you probably want a moderate power usage, if you are limited in gpu ressource you want to get the highest hashrate

OVERCLOCKING--

I am running DJM34's Windows binary with an intensity setting of "-i 16.3"' and +100 core / +300 mem overclock utilizing EVGA PrecisionX 16.  The result is 1200kh/s on my single 2GB GTX 960 SSC.  I earlier reported 1175kh/s with a lower overclock of +80/+240.  The overclock of +100/+300 was the highest stable overclock when mining Quark, but recent Quark code changes made it less stable.

If the 960 remains stable for a day or so, I may increase the overclock again.  For some reason, I had difficulty launching DJM34's Windows binary at default intensity, or my former setting of "-i 16.5" for Lyra2.  The card is mining Lyra2 instead of Quark, and making a big difference in my total Lyra2 hash power.

The card should be mining in the "P0" state, but PrecisionX 16 doesn't have a specific indicator for that.

I also play games with it.  Smiley       --scryptr
default intensity is set per compute version... since my compute_52 is a 980 with 4gb of memory it works well... obviously with a 960 with only 2GB, it might not work, it should use the same setting as the 750ti
p0 state are shown in nvidia inspector. (however if you didn't changed it, it is most likely running at p2, issue for all the 900 cards), and the mem oc is probably not passed at all

I think there is an option in latest sp version (on which my release is based actually) to set p0 state (haven't tried though...), I used the command line

nVidia Inspector--

OK, I switched to using nVidia Inspector (by Orbmu2k).  It detects the "P" state, and I have selected P0.  I have been playing with the clock settings, and the best hash rate I have reached is 1230kh/s.  I think that "P2" is the default P state for the card.  For the moment, I am running at +150/+300, and the hash rate varies between 1200kh/s and 1230kh/s.

This data is pretty raw, milder clocks most likely would give a stable 1200kh/s.       --scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
Epsylon3
Legendary
*
Offline Offline

Activity: 1484
Merit: 1082


ccminer/cpuminer developer


View Profile WWW
July 08, 2015, 02:03:18 PM
 #3942

yiimp is a "test pool" i try to set up... without auto exchange, i will update the main page to explain better soon...

will not be like the yaamp multipool system which require a lot of attention about trades

else... CUDA 7.5 really improve ccminer, on almost all algos :p

I don't know how you can say CUDA7.5 does so much better. It was just put out to developers, hence the 'RC' designation. Stands for 'Release Candidate', meaning it's in the early stages.

Concerning your 'test pool'; I wouldn't broadcast you are trying this until you are ready to pay-up! I almost started mining there thinking I would be paid for the work I was doing. Just sayin'!

Its wrote on the main page, Yiimp is not an "autotrade" platform... So like others pools you mine the currency you want with the -right- currency address. I dont want to pay in VTC (or BTC) the whole china which is using SHA farms

The pool is working and pay what is mined... I don't want a second exchange full time job Wink Consider the fees as a donation for the new algos... Some are set very high because we are doing "private" tests... you can still mine on those but its made to reduce "anonymous" users...

BTC: 1FhDPLPpw18X4srecguG3MxJYe4a1JsZnd - My Projects: ccminer - cpuminer-multi - yiimp - Forum threads : ccminer - cpuminer-multi - yiimp
rednoW
Legendary
*
Offline Offline

Activity: 1510
Merit: 1003


View Profile
July 08, 2015, 02:55:19 PM
 #3943

wow. new lyra2re code is here, thanks to djm and sp! My "hero" gtx750 was taken from shelf for this case. With 1500/1500 gpu/mem clock it shows 1115 khash/s with only 40% memory controller load. Old code was near 100% mem load.
And it still not very hot algo. 25-30% less power then quark.
scryptr
Legendary
*
Offline Offline

Activity: 1796
Merit: 1028



View Profile WWW
July 08, 2015, 03:06:57 PM
 #3944

GTX 960 SSC 2GB and Lyra2--

It just works!  With a setting of "-i 16.3", and +150/+300 core/mem, I get this:



GTX 960 mining Lyra2 with DJM34 Windows binary.

Actually, the hash rate fluctuates between 1200kh/s and the 1236kh/s shown, depending on system load.  The 960 is the only card in the Win 7 x64 system.       --scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
GingerAle
Legendary
*
Offline Offline

Activity: 1260
Merit: 1008


View Profile WWW
July 08, 2015, 03:14:51 PM
 #3945

has anyone got cryptonight hashrates for the 960? Was thinking of buying one today - tigerdirect has em for 190$ or something.

< Track your bitcoins! > < Track them again! > <<< [url=https://www.reddit.com/r/Bitcoin/comments/1qomqt/what_a_landmark_legal_case_from_mid1700s_scotland/] What is fungibility? >>> 46P88uZ4edEgsk7iKQUGu2FUDYcdHm2HtLFiGLp1inG4e4f9PTb4mbHWYWFZGYUeQidJ8hFym2WUmWc p34X8HHmFS2LXJkf <<< Free subdomains at moneroworld.com!! >>> <<< If you don't want to run your own node, point your wallet to node.moneroworld.com, and get connected to a random node! @@@@ FUCK ALL THE PROFITEERS! PROOF OF WORK OR ITS A SCAM !!! @@@@
misterycoins
Sr. Member
****
Offline Offline

Activity: 249
Merit: 250


View Profile
July 08, 2015, 03:28:46 PM
 #3946

something is strange here. CCminer must be still doing something but I can't find it in task manager.
flipclip
Member
**
Offline Offline

Activity: 111
Merit: 10


View Profile
July 08, 2015, 03:31:21 PM
 #3947


COMMIT/BUILD #843--

Each commit to GitHub increments the commit number (Upper Left Hand corner)  There is also a commit hash number, but it is not sequential, so I don't use it.  I just checked and your GitHub commit number is 843.  That is the commit that I built and am currently mining with; it is maybe 12 hours old now.       --scryptr

How do you do checkouts then?  I'm used to the command line and using the sha for checkouts, so that is why I am wondering.
scryptr
Legendary
*
Offline Offline

Activity: 1796
Merit: 1028



View Profile WWW
July 08, 2015, 03:38:27 PM
 #3948

has anyone got cryptonight hashrates for the 960? Was thinking of buying one today - tigerdirect has em for 190$ or something.

CRYPTONIGHT--

A few pages back, page 202, SP_ has a screen shot of a GTX 960 mining Cryptonight at 280H/s next to a 750ti mining at 300H/s.  Properly coded, the 960 should outperform the 750ti.

The Lyra2 code may have something in common, and yield a clue for optimization.       --scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
GingerAle
Legendary
*
Offline Offline

Activity: 1260
Merit: 1008


View Profile WWW
July 08, 2015, 03:44:54 PM
 #3949

has anyone got cryptonight hashrates for the 960? Was thinking of buying one today - tigerdirect has em for 190$ or something.

CRYPTONIGHT--

A few pages back, page 202, SP_ has a screen shot of a GTX 960 mining Cryptonight at 280H/s next to a 750ti mining at 300H/s.  Properly coded, the 960 should outperform the 750ti.

The Lyra2 code may have something in common, and yield a clue for optimization.       --scryptr

thanks for that, I apologize for my inability to scan through these pages. "Properly coded"... how do we make that happen? I was going to try and go through tsivs ccminer and just replace all of the fixed values with variables that could be set from command line and modified via an internal optimizer routine... but then I remembered I can't program, and just stared at the code and got really frustrated cause I'm like "I KNOW ITS IN THERE!"

bah.

< Track your bitcoins! > < Track them again! > <<< [url=https://www.reddit.com/r/Bitcoin/comments/1qomqt/what_a_landmark_legal_case_from_mid1700s_scotland/] What is fungibility? >>> 46P88uZ4edEgsk7iKQUGu2FUDYcdHm2HtLFiGLp1inG4e4f9PTb4mbHWYWFZGYUeQidJ8hFym2WUmWc p34X8HHmFS2LXJkf <<< Free subdomains at moneroworld.com!! >>> <<< If you don't want to run your own node, point your wallet to node.moneroworld.com, and get connected to a random node! @@@@ FUCK ALL THE PROFITEERS! PROOF OF WORK OR ITS A SCAM !!! @@@@
scryptr
Legendary
*
Offline Offline

Activity: 1796
Merit: 1028



View Profile WWW
July 08, 2015, 03:49:30 PM
 #3950


COMMIT/BUILD #843--

Each commit to GitHub increments the commit number (Upper Left Hand corner)  There is also a commit hash number, but it is not sequential, so I don't use it.  I just checked and your GitHub commit number is 843.  That is the commit that I built and am currently mining with; it is maybe 12 hours old now.       --scryptr

How do you do checkouts then?  I'm used to the command line and using the sha for checkouts, so that is why I am wondering.

COMMAND LINE--

I use the command line, and refer to the commit number when posting about performance.  The sha will verify checksum, and is very precise for that purpose.  Commit numbers are sequential.

The line, "git clone https://github.com/sp-hash/ccminer", should clone the latest commit.  If I am wrong, please tell me!

--scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2912
Merit: 1087

Team Black developer


View Profile
July 08, 2015, 04:13:58 PM
 #3951

lyra2:

Submitted a 90KHASH improvement on the gtx970 (6.3%)
and 20KHASH improvement on the 750ti.

Again by reducing the codesize to reduce instructioncache fetching

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2912
Merit: 1087

Team Black developer


View Profile
July 08, 2015, 05:08:00 PM
 #3952

has anyone got cryptonight hashrates for the 960? Was thinking of buying one today - tigerdirect has em for 190$ or something.
CRYPTONIGHT--
A few pages back, page 202, SP_ has a screen shot of a GTX 960 mining Cryptonight at 280H/s next to a 750ti mining at 300H/s.  Properly coded, the 960 should outperform the 750ti.
The Lyra2 code may have something in common, and yield a clue for optimization.       --scryptr

This is right. I think by using the djm34 teqniques the cryptight miner will be improve alot..By using vectors with the shuffle instruction to avoid global memory. A great task for DJM34. He likes this stuff, and does it well. A 1000 hours ++ job

I like to mod the kernals. Spend a few hours and gain a few percent. Wink

I don't use NSIGHT and I am not a registered CUDA developer.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2912
Merit: 1087

Team Black developer


View Profile
July 08, 2015, 06:06:09 PM
 #3953

I have buildt a new version:

-Merged and modded the new DJM34 lyra implementation. Lyra 66% faster on gtx970 and 46% faster on the 750ti
-small improvement in quark on the 750ti on standard clocks.

1.5.54(sp-MOD) is available here: (08-07-2015)

https://github.com/sp-hash/ccminer/releases/tag/1.5.54

The sourcecode is available here:

https://github.com/sp-hash/ccminer

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW MEOWPOW + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
jjjordan
Sr. Member
****
Offline Offline

Activity: 271
Merit: 251


View Profile
July 08, 2015, 08:43:40 PM
 #3954

7x970 - Lyra2 won't start (out of memory)
4GB system memory
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
July 08, 2015, 10:56:16 PM
 #3955

7x970 - Lyra2 won't start (out of memory)
4GB system memory
7x 970 ?!  Roll Eyes mobo are so expensive...
you need at least as much ram than vram here something like 28Gb to run that kind of system


djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
bathrobehero
Legendary
*
Offline Offline

Activity: 2002
Merit: 1051


ICO? Not even once.


View Profile
July 08, 2015, 11:22:50 PM
 #3956

7x970 - Lyra2 won't start (out of memory)
4GB system memory
7x 970 ?!  Roll Eyes mobo are so expensive...
you need at least as much ram than vram here something like 28Gb to run that kind of system



Surely there has to be a workaround. I mean the memory/swap doesn't even seem to be allocated let alone used, not even for a second.
Something like initializing the cards one after the other instead of all at the same time or something? Or giving the cards different jobs instead of working together on one big job? I have no idea but I'm sure there's a way.

Not your keys, not your coins!
joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
July 08, 2015, 11:59:33 PM
 #3957

7x970 - Lyra2 won't start (out of memory)
4GB system memory
7x 970 ?!  Roll Eyes mobo are so expensive...
you need at least as much ram than vram here something like 28Gb to run that kind of system



Surely there has to be a workaround. I mean the memory/swap doesn't even seem to be allocated let alone used, not even for a second.
Something like initializing the cards one after the other instead of all at the same time or something? Or giving the cards different jobs instead of working together on one big job? I have no idea but I'm sure there's a way.

How many cards can you run? It's cheaper to add more RAM than splitting the cards into
multiple rigs.

AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
djm34
Legendary
*
Offline Offline

Activity: 1400
Merit: 1050


View Profile WWW
July 09, 2015, 01:55:20 AM
 #3958

7x970 - Lyra2 won't start (out of memory)
4GB system memory
7x 970 ?!  Roll Eyes mobo are so expensive...
you need at least as much ram than vram here something like 28Gb to run that kind of system



Surely there has to be a workaround. I mean the memory/swap doesn't even seem to be allocated let alone used, not even for a second.
Something like initializing the cards one after the other instead of all at the same time or something? Or giving the cards different jobs instead of working together on one big job? I have no idea but I'm sure there's a way.
if you open msi AB and watch both ram and pagefile graphics, you'll see it gets allocated (more on the pagefile than on the memory) so may-be trying to increase pagefile could work.
There isn't really a work around on the code side, global memory variables have to be allocated from the host and cudamalloc works in mysterious way...)

djm34 facebook page
BTC: 1NENYmxwZGHsKFmyjTc5WferTn5VTFb7Ze
Pledge for neoscrypt ccminer to that address: 16UoC4DmTz2pvhFvcfTQrzkPTrXkWijzXw
scryptr
Legendary
*
Offline Offline

Activity: 1796
Merit: 1028



View Profile WWW
July 09, 2015, 01:56:59 AM
 #3959

SP_ RELEASE dot 54--

SP_'s reduction of register use in Lyra2 appears to have reduced some of the memory requirement.    I have been able to increase my intensity setting from my old standard of "-i 16.5" to higher values and see hash rate improvement, but the setting varies per machine.  My initial results, all for Lyra2:

  GTX 750ti FTW - 1080-1100kh/s per card  (Linux)
  GTX 750ti SC - 1140-1150kh/s per card (Win 8 )
  GTX 960 2GB SSC - 1220-1240kh/s per card (Win 7)
  GTX 960 4GB FTW - 1220-1240kh/s per card (Win 8 )
  GTX 970 4GB FTW+ - 2Mh/s per card (Linux)

The windows machines allow for easy software overclocking.  I still need to learn the command line API flags for Linux, and probably need to re-install Linux with the latest drivers for proper use.

If I move to CUDA Toolkit 7.5, will the SP_ releases still compile on Linux?       --scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
mendoza1468
Newbie
*
Offline Offline

Activity: 54
Merit: 0


View Profile
July 09, 2015, 03:35:12 AM
Last edit: July 09, 2015, 04:01:10 AM by mendoza1468
 #3960

Release 54

2 Gtx 970 / Nicehash (QUARK still the best payout BTC BTC  Grin )
EVGA 04G-2974-KR GeForce GTX 970 Superclocked 4GB

QUARK (0.014 BTC / day atm)
ccminer.exe -i 22.9 -r 5 -R 10 --cpu-priority 5 -q -a quark -o stratum+tcp://quark.usa.nicehash.com:3345 -u xxxxxxxxxxx -p x
31 350 khash/s

LYRA2 (0.003 BTC / day atm)
ccminer.exe -i 18 -r 5 -R 10 --cpu-priority 5 -q -a lyra2 -o stratum+tcp://quark.usa.nicehash.com:3342 -u xxxxxxxxxxx -p x
3950 khash/s VS 2383 khash/s Release53

QUBIT (0.006 BTC / day atm)
ccminer.exe -i 21 -r 5 -R 10 --cpu-priority 5 -a Qubit -o stratum+tcp://qubit.usa.nicehash.com:3344 -u xxxxxxxxxxx -p x
25 500 khash/s

X11 (0.0087 BTC / day atm)
ccminer.exe -i 21 -r 5 -R 10 --cpu-priority 5 -o stratum+tcp://quark.usa.nicehash.com:3336 -u xxxxxxxxxxx -p x
16 450 khash/s

X13 (0.006135 BTC / day atm)
ccminer.exe -i 19 -r 5 -R 10 --cpu-priority 5 -o stratum+tcp://x13.usa.nicehash.com:3337 -u xxxxxxxxxxx -p x
15 600 khash/s

X15 (0.0071 BTC / day atm)
ccminer.exe -i 21 -r 5 -R 10 --cpu-priority 5 -o stratum+tcp://x15.usa.nicehash.com:3339 -u xxxxxxxxxxx -p x
15 200 khash/s

KECCAK (0.0024 BTC / day atm)
ccminer.exe -i 22.9 -r 5 -R 10 --cpu-priority 5 -q -a Keccak -o stratum+tcp://keccak.usa.nicehash.com:3338 -u xxxxxxxxxxx -p x
878 800 khash/s


Thanks SP, djm34 and all other who help and contribute !!
Pages: « 1 ... 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 [198] 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 ... 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!