Bitcoin Forum
April 25, 2024, 04:53:19 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 [1225] 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 »
  Print  
Author Topic: CCminer(SP-MOD) Modded GPU kernels.  (Read 2347498 times)
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 13, 2019, 07:15:06 AM
Last edit: September 13, 2019, 07:45:47 AM by sp_
 #24481

Yeah, no PTX, that's what I was saying.
==> RandomX

So to make a fast randomx miner on NVIDIA you can convert the randomx code to ptx before execution. (Create a new ptx kernel for each block)

Without optimalizations the NVIDIA cards are loosing to the CPU.

randomx benchmarks:

https://bitcointalk.org/index.php?topic=5176747.0

GPUCryptonight-RRandomX
AMD
Vega 642200 H/s1225 H/s
RX 480/580960-1000 H/s400-410 H/s
RX 560 4GB (1400/2200 MHz)495 H/s260 H/s
NVIDIA/EVGA
RTX 2080 Ti (1915/13600 MHz)960-1000 H/s400-410 H/s
GTX 1080 Ti (2037/11800 MHz)927 H/s1122 H/s
GTX 1070 Ti (1900/7600 MHz)625 H/s769 H/s

For CPUs:
CPUCryptonight-RRandomX
AMD 3900X (4.25GHZ ALL CORE, 3600MHZ RAM)1335 H/s13330 H/s
RYZEN 3700X1018 H/s6853 H/s
RYZEN 5 3600803 H/s6580 H/s
INTEL I9 9900K630 H/s2102 H/s
2X XEON E5 2670 V2 930 H/s5815 H/s
INTEL I7 7700K350 H/s2100 H/s


Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
"You Asked For Change, We Gave You Coins" -- casascius
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714020799
Hero Member
*
Offline Offline

Posts: 1714020799

View Profile Personal Message (Offline)

Ignore
1714020799
Reply with quote  #2

1714020799
Report to moderator
joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
September 13, 2019, 04:33:06 PM
 #24482

The point with ptx is that it's a unified language for all NVIDIA gpu architechtures.

The point s that it's only Nvidia GPU architectures. No ASIC, no FPGA, no Radeon, no CPU.

AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 13, 2019, 08:24:57 PM
 #24483

The point with ptx is that it's a unified language for all NVIDIA gpu architechtures.
The point s that it's only Nvidia GPU architectures. No ASIC, no FPGA, no Radeon, no CPU.

Doesn't need to be PTX. If you run on NVIDIA hardware you convert the random stream of instructions to PTX. RandomX could be very profitable on NVIDIA hardware with a proper implementation...

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
September 13, 2019, 10:37:05 PM
 #24484

Doesn't need to be PTX. If you run on NVIDIA hardware you convert the random stream of instructions to PTX. RandomX could be very profitable on NVIDIA hardware with a proper implementation...

Precisely. You can build a Nvidia-only proof of concept, but a real product will need
it's own pseudo language that can be compiled to ptx/cuda, ocl, and x86 native instructions
producing identical functionality. The language would have to complex enough (in the CISC sense)
that the FPGA can't decode with a simple table lookup. That's a hell of a lot of work.

AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 14, 2019, 05:05:48 AM
 #24485

The language would have to complex enough (in the CISC sense) that the FPGA can't decode with a simple table lookup. That's a hell of a lot of work.

The FPGA have limits to memory access and multipliers. Let's say the FPGA can do 32 multiplications and 32 mem access per cycle, then you might be able to run 32 instruction per cycle. @500mhz


RandomX on the gpu doesn't need any memory access because the code is compiled, and you can run with 1024 threads at 2000Mz.

So the gpu can do 1024 instructions per cycle@2000mz

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 14, 2019, 05:47:38 AM
 #24486

The FPGA doesn't make N multiplications per cycle. It does N hashes per cycle, with N integer > 0 or, in the case of complex algorithms, 1/N.

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 14, 2019, 05:53:53 AM
Last edit: September 14, 2019, 06:15:07 AM by sp_
 #24487

The FPGA doesn't make N multiplications per cycle. It does N hashes per cycle, with N integer > 0 or, in the case of complex algorithms, 1/N.

Yes, but in Randomx the FPGA need to do a memory read per cycle to determine the instruction to be executed so the N hash doesn't apply. Then the new limit is N instructions where N is limited by the number of memory accesses the chip can do per cycle. In older FPGA designs it was normal to have ASIC multipliers you could use to speedup multiplications (f.ex Altera Cyclone IV). The multiplication could also be done in code.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 14, 2019, 06:11:31 AM
 #24488

True, but it's also true that you can fill the FPGA with custom made cores each executing RandomX instructions. FPGAs are plenty flexible, much more than GPUs, it only takes much more time to optimise.

sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 14, 2019, 06:30:33 AM
 #24489

With a compiled kernel, the GPU can execute 15000 Randomx Instructions in 15 cycles per hash@2000mhz.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Kodaman
Jr. Member
*
Offline Offline

Activity: 189
Merit: 2


View Profile
September 24, 2019, 07:07:09 PM
 #24490

maybe this place has the info. There are rumors about x16rv2 has already ASICS.
Any feedback?
joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
September 24, 2019, 07:52:41 PM
 #24491

If you don't produce a source it usually means you're starting the rumour.

AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
Kodaman
Jr. Member
*
Offline Offline

Activity: 189
Merit: 2


View Profile
September 24, 2019, 07:58:20 PM
 #24492

If you don't produce a source it usually means you're starting the rumour.

iBeLink in California is the ASIC provider. Is the source good enough Wink
Kodaman
Jr. Member
*
Offline Offline

Activity: 189
Merit: 2


View Profile
September 24, 2019, 08:06:44 PM
 #24493

X25X algo is the one and only GPU only algo not for all Nvidia GPUS but also for cards that have 6gb or less because it doesn't require memory hard operations.
For all the Nvidia cards from 1050ti 2 gb up to 1080 ti can all be mined without the fear of ASICS and FPGAS around.
T-Rex has the algo optimised maximum that is why no private miners or no new faster miners for X25X.
Today is the good information day lol
joblo
Legendary
*
Offline Offline

Activity: 1470
Merit: 1114


View Profile
September 24, 2019, 08:43:04 PM
 #24494

If you don't produce a source it usually means you're starting the rumour.

iBeLink in California is the ASIC provider. Is the source good enough Wink

If iBeLink has anounced it it's not a rumour, it's fact. Where did the rumour originate?
Is it your own speculation?

There's nothing in x16rv2 that makes it more technically difficult than x16r to implement on ASIC or FPGA.
It's probably only a matter of time and demand.


AKA JayDDee, cpuminer-opt developer. https://github.com/JayDDee/cpuminer-opt
https://bitcointalk.org/index.php?topic=5226770.msg53865575#msg53865575
BTC: 12tdvfF7KmAsihBXQXynT6E6th2c2pByTT,
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 27, 2019, 08:16:25 AM
Last edit: September 27, 2019, 08:28:26 AM by sp_
 #24495

T-Rex has the algo optimised maximum that is why no private miners or no new faster miners for ....

T-Rex is slow and not profitable. Bether to mine Beamv2 or Grin.

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Kodaman
Jr. Member
*
Offline Offline

Activity: 189
Merit: 2


View Profile
September 27, 2019, 09:24:39 AM
 #24496

T-Rex has the algo optimised maximum that is why no private miners or no new faster miners for ....

T-Rex is slow and not profitable. Bether to mine Beamv2 or Grin.
What about people with 1050tis, 1060 3gbs actually nvidia cards less than 8gbs. What should we mine, which algo and which miner?
sp_ (OP)
Legendary
*
Offline Offline

Activity: 2898
Merit: 1087

Team Black developer


View Profile
September 27, 2019, 09:31:23 AM
Last edit: September 27, 2019, 09:49:34 AM by sp_
 #24497

Beam can be mined with 3gb cards. gminer.  (windows 7,8.1 or linux)
Grin29 can be mined with 4/6gb cards. gminer (windows 7,8.1 or linux)
for 2gb cards monero  / randomx?

Team Black Miner (ETHB3 ETH ETC VTC KAWPOW FIROPOW ZILLIQA + dual mining + tripple mining.. https://github.com/sp-hash/TeamBlackMiner
Kodaman
Jr. Member
*
Offline Offline

Activity: 189
Merit: 2


View Profile
September 27, 2019, 03:32:19 PM
 #24498

Beam can be mined with 3gb cards. gminer.  (windows 7,8.1 or linux)
Grin29 can be mined with 4/6gb cards. gminer (windows 7,8.1 or linux)
for 2gb cards monero  / randomx?
Cool thanks for the tip but what about the windows 10 users?
scryptr
Legendary
*
Offline Offline

Activity: 1793
Merit: 1028



View Profile WWW
September 28, 2019, 06:34:52 PM
Last edit: September 28, 2019, 09:22:51 PM by scryptr
 #24499

Beam can be mined with 3gb cards. gminer.  (windows 7,8.1 or linux)
Grin29 can be mined with 4/6gb cards. gminer (windows 7,8.1 or linux)
for 2gb cards monero  / randomx?

RANDOMX IS A GOOD QUESTION--

I was just looking at SChernykh's github.  He has coded both CUDA and OpenCL versions for benchmarking RandomX on GPUs.  I compiled and ran the CUDA version on my 1070ti (8GB) rig, and got about 669H/s RandomX per single 1070ti.  I'll try on my 750ti (2GB) rig in a little while.  Both are Linux rigs.

Maybe you could plug the RandomX algo into SuprMiner.  I noticed that none of the commercial, closed source CCminer clones came out with an x16rv2 version until after you modded your SuprMiner source with it.       --scryptr

TIPS:  BTC - 1Fs4uZ6a9ABYBTaHGUfqcwCQmeBRxkKRQT    DASH - XrK81tW31SLsVvZ2WX9VhTjpT6GXJPLdbQ
          SCRYPTR'S NOTEBOOK: https://bitcointalk.org/index.php?topic=5035515.msg46035530#msg46035530
          GITHUB: "github.com/scryptr"  MERIT is appreciated, also.  Thanks!
pallas
Legendary
*
Offline Offline

Activity: 2716
Merit: 1094


Black Belt Developer


View Profile
September 28, 2019, 07:01:33 PM
 #24500

The difference between x16rv2 and x16r is just tiger, which is a pretty basic algorithm, already available as opensource cuda code on my m7 miner years ago, and as commercial miner used in software supporting my x22i and x25x.

Pages: « 1 ... 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 [1225] 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!