Bitcoin Forum
November 18, 2024, 08:12:52 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 [13] 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 ... 65 »
  Print  
Author Topic: AMD Mem Tweak XL - Read/modify timings/pp/straps on the fly  (Read 58893 times)
GKumaran
Member
**
Offline Offline

Activity: 204
Merit: 10


View Profile
April 15, 2019, 04:59:18 AM
 #241

Vega 64, aircooled, samsung:

  • Highest Hashrate impact : lower RC (good cards : 44, lower cards 45) - too low leads to invalid shares
  • RP has high impact on stabilty, but low impact on hashrate (preferred 12 as average for stability) -- too low leads to gpu crash withing minutes
  • Second highest HR impact : lower RFC (248) - too low crashes cards
  • Having RRDS and RRDL as same value helps in stability
  • Better to leave CL unimpacted in my tests, it is pretty much linked to all values.
  • There are some hidden param that are not exposed by the tool, so it is advised that if you are not gonna change a value, dont use that param to set the default value, ignore the parameter
  • Normal memory overclock plays a huge part in this tuning. 1107mhz is the highest you can reach without breaking the SOC barrier. You must have a stable non invalid shares throwing OC b4 appling the timing.
  • My 6 cards timing and OC:
    • 1. 1107 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Valid Shares : Highest OC, Timing
    • 2. 1107 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Valid Shares : Highest OC, Timing
    • 3. 1050 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest Timing, Invalid Shares if 1107 OC - 1 invalid per 10 mins
    • 4. 1050 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest Timing, Invalid Shares if 1107 OC - 1 invalid per 10 mins
    • 5. 1107 : --rcdrd 12 --rcdwr 5 --rc 45 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest OC, Invalid Shares if --rc 44 - 1 invalid per 4 hours
    • 6. 1107 : --rcdrd 12 --rcdwr 5 --rc 45 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest OC, Invalid Shares if --rc 44 - 1 invalid per 3 hours

My testing have been limited to only few params, yet to fully test the other params.
ingyenfrag
Newbie
*
Offline Offline

Activity: 84
Merit: 0


View Profile
April 15, 2019, 06:47:15 AM
 #242

Still running. There is no problem even on the lower quality silicon GPU's (yet).  Ref Vega 56@64 bios, samsung HBM.

This is my lighter timing, hopefully stable:
--CL 19 --RAS 30 --RCDRD 12 --RCDWR 6 --RC 44 --RP 13 --RRDS 5 --RRDL 5 --RTP 4 --FAW 18 --CWL 6 --WTRS 4 --WTRL 9 --WR 15 --WRRD 1 --RDWR 18 --REF 17000 --RFC 248

please test it.
latest 19.4.1 drivers. win10
vega64 lc ~1360/1100@875 51.4mhs with claymore 12. But hbm temp is hot 70C and one weaker vega gives sometimes red incorrect shares

22.3 kh/s trtl, testing for pool speed and hw errors

Power Consumption at HW info ?
150 for eth and 135 for trtl

I'm testing it on my both 6 and 5 card rigs.
On the 6 card rig power consumption went up from 1155W to 1205W for roughly 10% hashrate gain (these are the worse silicon GPU-s, 3 of those running on 0.85V, 1 on 0.8375, last 2 on 0.825V).
Unfortunately currently there is no watt meter on the 5 card rig.
I will update about the stability.

What happen with the timings when you reboot?
If it stays after reboot, how do you set the timings back to the default values?

Zorg33
Jr. Member
*
Offline Offline

Activity: 144
Merit: 2


View Profile
April 15, 2019, 07:00:06 AM
 #243

Still running. There is no problem even on the lower quality silicon GPU's (yet).  Ref Vega 56@64 bios, samsung HBM.

noice!

 Wink
Miner Addict
Newbie
*
Offline Offline

Activity: 31
Merit: 0


View Profile
April 15, 2019, 07:05:31 AM
 #244

Still running. There is no problem even on the lower quality silicon GPU's (yet).  Ref Vega 56@64 bios, samsung HBM.

This is my lighter timing, hopefully stable:
--CL 19 --RAS 30 --RCDRD 12 --RCDWR 6 --RC 44 --RP 13 --RRDS 5 --RRDL 5 --RTP 4 --FAW 18 --CWL 6 --WTRS 4 --WTRL 9 --WR 15 --WRRD 1 --RDWR 18 --REF 17000 --RFC 248

please test it.
latest 19.4.1 drivers. win10
vega64 lc ~1360/1100@875 51.4mhs with claymore 12. But hbm temp is hot 70C and one weaker vega gives sometimes red incorrect shares

22.3 kh/s trtl, testing for pool speed and hw errors

Power Consumption at HW info ?
150 for eth and 135 for trtl

I'm testing it on my both 6 and 5 card rigs.
On the 6 card rig power consumption went up from 1155W to 1205W for roughly 10% hashrate gain (these are the worse silicon GPU-s, 3 of those running on 0.85V, 1 on 0.8375, last 2 on 0.825V).
Unfortunately currently there is no watt meter on the 5 card rig.
I will update about the stability.

What happen with the timings when you reboot?
If it stays after reboot, how do you set the timings back to the default values?


Overdriventool : reset and apply
Miner Addict
Newbie
*
Offline Offline

Activity: 31
Merit: 0


View Profile
April 15, 2019, 07:18:37 AM
 #245

Vega 64, aircooled, samsung:

  • Highest Hashrate impact : lower RC (good cards : 44, lower cards 45) - too low leads to invalid shares
  • RP has high impact on stabilty, but low impact on hashrate (preferred 12 as average for stability) -- too low leads to gpu crash withing minutes
  • Second highest HR impact : lower RFC (248) - too low crashes cards
  • Having RRDS and RRDL as same value helps in stability
  • Better to leave CL unimpacted in my tests, it is pretty much linked to all values.
  • There are some hidden param that are not exposed by the tool, so it is advised that if you are not gonna change a value, dont use that param to set the default value, ignore the parameter
  • Normal memory overclock plays a huge part in this tuning. 1107mhz is the highest you can reach without breaking the SOC barrier. You must have a stable non invalid shares throwing OC b4 appling the timing.
  • My 6 cards timing and OC:
    • 1. 1107 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Valid Shares : Highest OC, Timing
    • 2. 1107 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Valid Shares : Highest OC, Timing
    • 3. 1050 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest Timing, Invalid Shares if 1107 OC - 1 invalid per 10 mins
    • 4. 1050 : --rcdrd 12 --rcdwr 5 --rc 44 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest Timing, Invalid Shares if 1107 OC - 1 invalid per 10 mins
    • 5. 1107 : --rcdrd 12 --rcdwr 5 --rc 45 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest OC, Invalid Shares if --rc 44 - 1 invalid per 4 hours
    • 6. 1107 : --rcdrd 12 --rcdwr 5 --rc 45 --rp 12 --rrds 3 --rrdl 3 --rfc 248 : Highest OC, Invalid Shares if --rc 44 - 1 invalid per 3 hours

My testing have been limited to only few params, yet to fully test the other params.

ETH and CNR ?
pbfarmer
Member
**
Offline Offline

Activity: 340
Merit: 29


View Profile
April 15, 2019, 09:56:02 AM
Last edit: April 16, 2019, 10:59:49 PM by pbfarmer
Merited by rednoW (1)
 #246

First - quick results for cn-trtl, w/ efficiency-focused settings, using timings mostly borrowing from others here, w/ some minor tweaks:

Vega 64 air, ubuntu 18.04 + amdgpu-pro 18.50, TRM 0.4.3 (L18+18), 852 cclock (p0)/1107 mclock/818mv, power readings at the wall

stock timings:
--CL 20 --RAS 33 --RCDRD 16 --RCDWR 10 --RC 47 --RP 14 --RRDS 4 --RRDL 6 --RFC 260 (--REF 3900)
18.5 kh/s @ 135w (137 h/w)

modded timings 1
--CL 19 --RAS 28 --RCDRD 12 --RCDWR 5 --RC 44 --RP 12 --RRDS 3 --RRDL 3 --RFC 248
19.75 kh/s @ 137w (144 h/w)

modded timings 2
same as above, plus --REF 15600
20.71 kh/s @ 137w (151 h/w)


Second - notes on power...  I don't see any appreciable power differences - nor would I expect to.  Clocks and voltages are untouched, we simply have a bit more data being transferred.  Even the 2w difference I'm reporting here is conservative - taking natural fluctuations in my readings into account, my actual increase could be closer to <= 1w.  People seeing large power increases (at least on vega 64) seem to have something else going on.


Last - some conjecture / educated guessing re: THAT --REF THO!!!  I'm assuming --REF is the refresh frequency, in nanoseconds, and unlike most timings, a higher value (meaning less refreshing) is better.  Refreshes steal bandwidth, and AMD seems to have gone majorly conservative (aggressive?) on this, probably due to the super high temps of the HBM during normal/gaming use.  As leakage increases w/ temps, more refreshes would be required when running your GPU/HBM at high clocks/voltages.  Since (efficient) miners tend to run cooler, the crazy high default refresh rate is really unnecessary.  I found 4x the default to be around where returns quickly diminish, at least at my clocks - i can get maybe another 50 h/s (turtle) going 4.5x.  HOWEVER - if you run super aggressive for max h/r, or just aren't effectively cooled in general, you may want to dial this back, or you may start seeing mem errors / bad shares from corrupted data due to insufficient refreshing / leakage.

vmozara
Member
**
Offline Offline

Activity: 190
Merit: 59


View Profile
April 15, 2019, 11:36:59 AM
 #247

That is some crazy efficiency. I guess that is not even possible on windows, right? Maybe it is time for me to stop being scared and tried alternative operating systems.
Anwil
Newbie
*
Offline Offline

Activity: 13
Merit: 5


View Profile
April 15, 2019, 12:09:36 PM
Last edit: April 15, 2019, 12:49:12 PM by Anwil
 #248

Rx Vega 56 hynix memory  
1250/875mv   916/925mv
tartl cn 20.12 - 20.20  kh/s  L24+24
CN/r   2160   1450/900
60 min no errors.
WinAMDTweak.exe --rcdrd 19 --rcdwr 4 --rc 35 --rp 13 --rrds 4 --rrdl 5 --rfc 148 --REF 15600
Cool!!!
GKumaran
Member
**
Offline Offline

Activity: 204
Merit: 10


View Profile
April 15, 2019, 12:25:28 PM
 #249

That is some crazy efficiency. I guess that is not even possible on windows, right? Maybe it is time for me to stop being scared and tried alternative operating systems.

pbfarmer is our resident magician, we strive to reach his god tier levels of efficiency  Grin Grin
Was waiting for his take on the timing to replicate  Cheesy

First - quick results for cn-trtl, w/ efficiency-focused settings, using timings mostly borrowing from others here, w/ some minor tweaks:

Vega 64 air, ubuntu 18.04 + amdgpu-pro 18.50, TRM 0.4.3 (L18+18), 852 cclock (p0)/1107 mclock/818mv, power readings at the wall

stock timings:
--CL 20 --RAS 33 --RCDRD 16 --RCDWR 10 --RC 47 --RP 14 --RRDS 4 --RRDL 6 --RFC 260 (--REF 3900)
18.5 kh/s @ 135w (137 h/w)

modded timings 1
--CL 19 --RAS 28 --RCDRD 12 --RCDWR 5 --RC 44 --RP 12 --RRDS 3 --RRDL 3 --RFC 248
19.75 kh/s @ 137w (144 h/w)

modded timings 2
same as above, plus --REF 15600
20.71 kh/s @ 137w (151 h/w)


Ill test ur CL, RAS and REF timing. Thx pbfarmer.

Results:
Cards : Vega 64 (aircooled, reference, samsung)
Miner : SRBMiner 1.8.3
Algo : Cryptonight litev7
Clocks : 1408/1107/875 (ODT)
Actual : 1354/1107/850 (HWinfo)

Stock                                                                                                                                                           : 4530
Todxx               --rp 10 --rc 44 --rfc 300 --rrd 3 --rcdrd 12 --rcdwr 5                                                                : 4800
My current        --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5                                                   : 4865 (7.4% inc over stock)
pbfarmer          --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5 --CL 19 --RAS 28 --REF 15600       : 4980 (9.9% inc over stock)

In all seriousness, was w8ing for ur numbers the most and u didnt disappoint  Grin Grin
GKumaran
Member
**
Offline Offline

Activity: 204
Merit: 10


View Profile
April 15, 2019, 02:38:33 PM
Last edit: April 15, 2019, 04:39:24 PM by GKumaran
 #250


Results:
Cards : Vega 64 (aircooled, reference, samsung)
Miner : SRBMiner 1.8.3
Algo : Cryptonight litev7
Clocks : 1408/1107/875 (ODT)
Actual : 1354/1107/850 (HWinfo)

Stock                                                                                                                                                           : 4530
Todxx               --rp 10 --rc 44 --rfc 300 --rrd 3 --rcdrd 12 --rcdwr 5                                                                : 4800
My current        --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5                                                   : 4865 (7.4% inc over stock)
pbfarmer          --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5 --CL 19 --RAS 28 --REF 15600       : 4980 (9.9% inc over stock)


Results:
Cards : Vega 64 (aircooled, reference, samsung)
Platform : Windows 10, 18.6.1
Miner : PheonixMiner 4.2c
Algo : Ethash
Clocks : 1216/1107/875 (ODT)
Actual : 1187/1107/850 (HWinfo)

Stock                                                                                                                                                           : 44.75 mh/s
Current Timing  --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5 --CL 19 --RAS 28 --REF 15600       : 50.12 (12% inc over stock)

Finally broke the Ethash 50mh/s barrier in windows  Grin Grin Smiley Smiley
dragonmike
Hero Member
*****
Offline Offline

Activity: 1274
Merit: 556



View Profile
April 15, 2019, 03:00:31 PM
 #251

I'm trying to take it slowly now.

So far I've been doing really well with these timings on CN-Turtle, i.e. improved hashrate on miner AND pool reflects it exactly (124.5 Kh/s miner hash, 124.5 Kh/s pool hash, according to TRM):

--CL 16 --RAS 30 --RCDRD 14 --RP 12 --RC 44

It's not the highest hashrate I've had using other timings posted here earlier, but the hash increase is 100% consistent on the pool, as opposed to what I experienced with too aggressive timings previously (I saw up to 21.3 Kh/s per Vega but the effective pool hash was A LOT lower).

I'll keep tweaking, but if anyone needs a basis to get started with, this works very well on my ref Samsung 56@64 rig.
hammuh
Jr. Member
*
Offline Offline

Activity: 41
Merit: 1


View Profile
April 15, 2019, 04:18:02 PM
Last edit: April 15, 2019, 07:36:07 PM by hammuh
 #252

Rx Vega 56 hynix memory  
1250/875mv   916/925mv
tartl cn 20.12 - 20.20  kh/s  L24+24
CN/r   2160   1450/900
60 min no errors.
WinAMDTweak.exe --rcdrd 19 --rcdwr 4 --rc 35 --rp 13 --rrds 4 --rrdl 5 --rfc 148 --REF 15600
Cool!!!

Looks good so far. About 1 kh/s boost to turtle. Didn't work with all my hynix tho. Need to figure out what to change to get to work with weaker card.

Edit: after longer time most gpu crash or have hashrate drop
reaper7881
Newbie
*
Offline Offline

Activity: 27
Merit: 0


View Profile
April 15, 2019, 04:22:00 PM
Last edit: April 15, 2019, 05:46:02 PM by reaper7881
 #253

First - quick results for cn-trtl, w/ efficiency-focused settings, using timings mostly borrowing from others here, w/ some minor tweaks:

Vega 64 air, ubuntu 18.04 + amdgpu-pro 18.50, TRM 0.4.3 (L18+18), 852 cclock (p0)/1107 mclock/818mv, power readings at the wall

stock timings:
--CL 20 --RAS 33 --RCDRD 16 --RCDWR 10 --RC 47 --RP 14 --RRDS 4 --RRDL 6 --RFC 260 (--REF 3900)
18.5 kh/s @ 135w (137 h/w)

modded timings 1
--CL 19 --RAS 28 --RCDRD 12 --RCDWR 5 --RC 44 --RP 12 --RRDS 3 --RRDL 3 --RFC 248
19.75 kh/s @ 137w (144 h/w)

modded timings 2
same as above, plus --REF 15600
20.71 kh/s @ 137w (151 h/w)


What bios are you using . I can get 20.1khs per card but have to run 1250 core 1100mem using 145w


Edit

Managed to get it to 21.8 khs @ 125w in hwinfo pulling 1060w from wall with 6 vegas approx 130khs trt
Zorg33
Jr. Member
*
Offline Offline

Activity: 144
Merit: 2


View Profile
April 15, 2019, 07:19:37 PM
 #254

Anyone analyzed the built-in HBM tuning in the driver?

I just started looking at it and seems it modifies RRD values and probably some others too.
Might get some good ideas from there....
clousian
Newbie
*
Offline Offline

Activity: 33
Merit: 0


View Profile
April 15, 2019, 07:46:46 PM
 #255

Every combo I've tried on Polaris (even translating modded bios strap values) results in a crash to blue screen for me. At this stage, I'd say this tool is unusable for Polaris cards.
pbfarmer
Member
**
Offline Offline

Activity: 340
Merit: 29


View Profile
April 15, 2019, 07:56:31 PM
 #256

That is some crazy efficiency. I guess that is not even possible on windows, right? Maybe it is time for me to stop being scared and tried alternative operating systems.

I tested the base settings w/o modded timings on win10pro w/ 18.x drivers in the past, and saw the same results.  There may be differences w/ 19.x - I haven't tried yet.

That being said, a linux setup is pretty simple and equally performant for many algos these days.
nars28
Newbie
*
Offline Offline

Activity: 25
Merit: 0


View Profile
April 15, 2019, 08:08:13 PM
 #257

Every combo I've tried on Polaris (even translating modded bios strap values) results in a crash to blue screen for me. At this stage, I'd say this tool is unusable for Polaris cards.

I have noticed that if you throw out --cl value it doesn't crash when applying string. So it must be some bug in applying --CL. Also the timings are reset to default when i run miner. If i apply them while mining - Bluescreen.
pbfarmer
Member
**
Offline Offline

Activity: 340
Merit: 29


View Profile
April 15, 2019, 08:11:54 PM
 #258


Ill test ur CL, RAS and REF timing. Thx pbfarmer.

Results:
Cards : Vega 64 (aircooled, reference, samsung)
Miner : SRBMiner 1.8.3
Algo : Cryptonight litev7
Clocks : 1408/1107/875 (ODT)
Actual : 1354/1107/850 (HWinfo)

Stock                                                                                                                                                           : 4530
Todxx               --rp 10 --rc 44 --rfc 300 --rrd 3 --rcdrd 12 --rcdwr 5                                                                : 4800
My current        --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5                                                   : 4865 (7.4% inc over stock)
pbfarmer          --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5 --CL 19 --RAS 28 --REF 15600       : 4980 (9.9% inc over stock)

In all seriousness, was w8ing for ur numbers the most and u didnt disappoint  Grin Grin


Most of my settings are just using values which seem to be gathering consensus here - from people such as yourself Smiley  I honestly didn't put much time into tinkering, other than adjusting a couple 'derived' values based on what seemed more appropriate mathematically.  I'd like to look into more, esp CL/FAW/WR...

I mainly wanted to point out the outsized effect of increasing REF, and put some thoughts to what it does, and why it's ok to go from 3.9K to say 15-17K.
clousian
Newbie
*
Offline Offline

Activity: 33
Merit: 0


View Profile
April 15, 2019, 08:13:32 PM
 #259

Every combo I've tried on Polaris (even translating modded bios strap values) results in a crash to blue screen for me. At this stage, I'd say this tool is unusable for Polaris cards.

I have noticed that if you throw out --cl value it doesn't crash when applying string. But without it the hashrate is even lower than stock. So it must be some bug in applying --CL.

Interesting. I guess my point still stands then. Unusable currently on Polaris.
pbfarmer
Member
**
Offline Offline

Activity: 340
Merit: 29


View Profile
April 15, 2019, 08:17:39 PM
 #260


Results:
Cards : Vega 64 (aircooled, reference, samsung)
Platform : Windows 10, 18.6.1
Miner : PheonixMiner 4.2c
Algo : Ethash
Clocks : 1216/1107/875 (ODT)
Actual : 1187/1107/850 (HWinfo)

Stock                                                                                                                                                           : 44.75 mh/s
Current Timing  --rp 12 --rc 44 --rfc 250 --rrds 3 --rrdl 3 --rcdrd 12 --rcdwr 5 --CL 19 --RAS 28 --REF 15600       : 50.12 (12% inc over stock)

Finally broke the Ethash 50mh/s barrier in windows  Grin Grin Smiley Smiley

Ethash is even better than cn-trtl wrt being able to drop core clock on vega.  Where you may lose 5+% h/r on trtl by going to cclock p0 (852mhz), I saw zero h/r loss on ethash when doing the same, at least w/ stock timings.  Should allow you to get your voltage down significantly - closer to 800mv.
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 [13] 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 ... 65 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!