sp_ (OP)
Legendary
Offline
Activity: 2898
Merit: 1087
Team Black developer
|
|
September 05, 2015, 12:04:43 PM |
|
I reverted the jh512 change and the rotr change.
That is suspect: quark jh and x11 bmw were redefining rotr32 with the original sph code, while all the rest use the cuda_helper version which has specific cuda code. If the cuda specific code is slower, than we should replace the rotr32 definition in cuda_helper because all the other files will benefit from it. But I highly doubt that it's the case: cuda_helper has been worked on for months and it can't be slower than sph code, unless the compiler is so smart that it can detect it's a rotate and replace it with even more optimized code. In brief, I wouldn't reput the double definitions (it's a warning...) but adjust the cuda_helper version, so we have the best rotr32 on all algos. The compiler is smart. it converts the ROTL32c(x, n) ((x) << (n)) | ((x) >> (32 - (n))) to one assembly instrucion. the funnelshift. Take a look at the generated .ptx code in the release folder. But in the lyra2v2 algo I now get +50KHASH on the gtx 970, so sometimes it's faster to use the method in cuda helper.
|
|
|
|
|
|
|
"Bitcoin: mining our own business since 2009" -- Pieter Wuille
|
|
|
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
|
sp_ (OP)
Legendary
Offline
Activity: 2898
Merit: 1087
Team Black developer
|
|
September 05, 2015, 12:10:42 PM |
|
I think it's because the compiler is unable to precalculate and remove instructions that operate on contant data when we use inline asm. rotl(const,x)
|
|
|
|
pallas
Legendary
Offline
Activity: 2716
Merit: 1094
Black Belt Developer
|
|
September 05, 2015, 12:16:12 PM |
|
with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-)
"in the lyra2v2 algo I now get +50KHASH on the gtx 970" but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2898
Merit: 1087
Team Black developer
|
|
September 05, 2015, 12:21:21 PM |
|
with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-) "in the lyra2v2 algo I now get +50KHASH on the gtx 970" but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)
yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it.
|
|
|
|
pallas
Legendary
Offline
Activity: 2716
Merit: 1094
Black Belt Developer
|
|
September 05, 2015, 12:42:25 PM |
|
with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-) "in the lyra2v2 algo I now get +50KHASH on the gtx 970" but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)
yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it. I see. But that means that x11 (and others based on the same file) should also be faster :-)
|
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2898
Merit: 1087
Team Black developer
|
|
September 05, 2015, 12:50:49 PM |
|
with the current commit (reverting quark_jh and quark_skein but not groestl) quark is a bit faster... that's puzzling :-) "in the lyra2v2 algo I now get +50KHASH on the gtx 970" but lyra2v2 is not using quark/, Algo256/ or groestl... still puzzled ;-)
yes, but you also changed the rotl32 in the blake256 And it seems to be a little bether there. So I kept it. I see. But that means that x11 (and others based on the same file) should also be faster :-) x11 uses blake 512. the 256 algos are neoscrypt,lyra, and lyra2
|
|
|
|
sp_ (OP)
Legendary
Offline
Activity: 2898
Merit: 1087
Team Black developer
|
|
September 05, 2015, 12:51:30 PM |
|
Submitted a speedup 50KHASH in x11 on the 980ti. With code from the klaus_t fork. and constant rotates.
|
|
|
|
t-nelson
Member
Offline
Activity: 70
Merit: 10
|
|
September 05, 2015, 05:19:55 PM |
|
-- SNIP --
needs to include cuda_vector.h instead of cuda_helper.h I didn't modify the includes. Furthermore, it compiles fine on Linux... I know the build files on windows have been enhanced recently, maybe it's related to that. I can't help because I never built on win. The header deps are a mess. Odds are it was pulled in by some other header on Linux and you got (un?)lucky.
|
BTC: 1K4yxRwZB8DpFfCgeJnFinSqeU23dQFEMu DASH: XcRSCstQpLn8rgEyS6yH4Kcma4PfcGSJxe
|
|
|
t-nelson
Member
Offline
Activity: 70
Merit: 10
|
|
September 05, 2015, 06:08:06 PM |
|
Submitted a speedup 50KHASH in x11 on the 980ti. With code from the klaus_t fork. and constant rotates.
Seems on par with 65 on 750ti and 960. At least no regression noticeable.
|
BTC: 1K4yxRwZB8DpFfCgeJnFinSqeU23dQFEMu DASH: XcRSCstQpLn8rgEyS6yH4Kcma4PfcGSJxe
|
|
|
scryptr
Legendary
Offline
Activity: 1793
Merit: 1028
|
|
September 05, 2015, 10:51:39 PM |
|
GIMP ON YIIMP-- For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org). MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue. There must be a decimal place or two missing in the calculation! I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each. I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think. Not so sure! Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least. I solved ~40 blocks at 40 MUE coins per block. Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining. There was no other miner to share the blocks with me, just the 1% pool share to deduct. If I could only find the right coin... --scryptr
|
|
|
|
bathrobehero
Legendary
Offline
Activity: 2002
Merit: 1051
ICO? Not even once.
|
|
September 05, 2015, 11:06:44 PM |
|
GIMP ON YIIMP-- For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org). MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue. There must be a decimal place or two missing in the calculation! I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each. I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think. Not so sure! Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least. I solved ~40 blocks at 40 MUE coins per block. Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining. There was no other miner to share the blocks with me, just the 1% pool share to deduct. If I could only find the right coin... --scryptr MonetaryUnit (MUE) Exchange volume (24h): 0.02 BTC. No offense, but I still chuckle everytime when I see a multipool picking up coins with 0 volume. If I ever launch a coin it will have coin maturity of about a week and it will change the algo every few weeks just to throw multipools off.
|
Not your keys, not your coins!
|
|
|
chrysophylax
Legendary
Offline
Activity: 2814
Merit: 1091
--- ChainWorks Industries ---
|
|
September 06, 2015, 02:08:45 AM |
|
GIMP ON YIIMP-- For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org). MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue. There must be a decimal place or two missing in the calculation! I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each. I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think. Not so sure! Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least. I solved ~40 blocks at 40 MUE coins per block. Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining. There was no other miner to share the blocks with me, just the 1% pool share to deduct. If I could only find the right coin... --scryptr MonetaryUnit (MUE) Exchange volume (24h): 0.02 BTC. No offense, but I still chuckle everytime when I see a multipool picking up coins with 0 volume. If I ever launch a coin it will have coin maturity of about a week and it will change the algo every few weeks just to throw multipools off. id be in on that ... #crysx
|
|
|
|
bensam1231
Legendary
Offline
Activity: 1750
Merit: 1024
|
|
September 06, 2015, 06:23:03 AM |
|
GIMP ON YIIMP-- For the last couple hours I mined MUE coin (Monetary Unit) on YiiMP (yiimp.ccminer.org). MUE is a quark algorithm coin, and according to CoinWarz, currently capable of producing ~200-400 dollars a day in mining revenue. There must be a decimal place or two missing in the calculation! I mined over 1000 MUE in the space of time that I spent on YiiMP, but they are currenly worth about 15 satoshis each. I will be earning about .00015 BTC on the exchange when my MUE coins mature, I think. Not so sure! Every block that I hit, I got all the coins save the pool share. Somehow, CoinWarz led me to believe that I'd be earning 100X more, at least. I solved ~40 blocks at 40 MUE coins per block. Because YiiMP is more or less a hobby/experimental pool, it was as if I was solomining. There was no other miner to share the blocks with me, just the 1% pool share to deduct. If I could only find the right coin... --scryptr Volume bro... Mining a dead coin. More hashrate you have, the bigger coins you have to mine. Putting aside volume, large amounts of hashrate instantly destroy the difficulty on small coins making them pointless to mine.
|
I buy private Nvidia miners. Send information and/or inquiries to my PM box.
|
|
|
ldp5500
|
|
September 06, 2015, 07:48:54 AM |
|
Optimization anything about GTX950? I have three cards in wait.
|
|
|
|
kama
|
|
September 06, 2015, 08:17:54 AM |
|
Optimization anything about GTX950? I have three cards in wait. optimizations are base specific(i mean compute 5.2 or 5.0)not card specific
|
|
|
|
go6ooo1212
Legendary
Offline
Activity: 1512
Merit: 1000
quarkchain.io
|
|
September 06, 2015, 09:58:22 AM |
|
Optimization anything about GTX950? I have three cards in wait. Yesterday I've been testing one of these babies. It reaches 8.77MH on quark with overclock...
|
|
|
|
skunk
|
|
September 06, 2015, 11:06:43 AM |
|
master's whirlpoolx seems broken, it segfaults under linux...
|
|
|
|
ldp5500
|
|
September 06, 2015, 11:41:35 AM |
|
Optimization anything about GTX950? I have three cards in wait. Yesterday I've been testing one of these babies. It reaches 8.77MH on quark with overclock... That is really good, which version to use? THX.
|
|
|
|
go6ooo1212
Legendary
Offline
Activity: 1512
Merit: 1000
quarkchain.io
|
|
September 06, 2015, 12:52:34 PM |
|
Optimization anything about GTX950? I have three cards in wait. Yesterday I've been testing one of these babies. It reaches 8.77MH on quark with overclock... That is really good, which version to use? THX. r.65 - the last
|
|
|
|
|