Bitcoin Forum
November 08, 2024, 08:07:46 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 »
  Print  
Author Topic: Custom RAM Timings for GPU's with GDDR5 - DOWNLOAD LINKS - UPDATED  (Read 155640 times)
kilo17 (OP)
Legendary
*
Offline Offline

Activity: 980
Merit: 1001

aka "whocares"


View Profile
March 26, 2017, 05:31:28 AM
 #441


It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Bitcoin Will Only Succeed If The Community That Supports It Gets Support - Support Home Miners & Mining
rednoW
Legendary
*
Offline Offline

Activity: 1510
Merit: 1003


View Profile
March 26, 2017, 06:33:54 AM
Last edit: March 26, 2017, 07:31:46 AM by rednoW
 #442


My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 26, 2017, 08:08:02 AM
 #443


It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
kilo17 (OP)
Legendary
*
Offline Offline

Activity: 980
Merit: 1001

aka "whocares"


View Profile
March 26, 2017, 10:02:41 AM
 #444


It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.

You are very welcome.  BTW- take a look a AIDA - you can print off the actual voltage at each DPM for each individual GPU - it can give you all kinds of great info in addition to that

Bitcoin Will Only Succeed If The Community That Supports It Gets Support - Support Home Miners & Mining
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 26, 2017, 11:24:23 AM
 #445


It really depends on the DPM voltage set. Some gpus have higher DPM state without global vddc offset, for example my last MSI Armor 4G 470 has 1.185V DPM7 voltage, while RX 480 Sapphire with global offset of 04 has default 1.0V+0.25V. Open wattool and it will tell you all Smiley


That is a little misleading.  Yes WattTool tells you that info but the voltage at DPM7 (or any other for that matter) is determined by the ASIC Quality and is not "set" by the manufacturer.  Yes, some Sapphire Bios' have the +.25mV offset but if you flip the bios switch the other side usually does not.  

Regardless, the ASIC quality determines the DPM and is not directly set by the manufacturer and it does not matter which manufacturer.  That is easy to see by simply setting a manual value that is to low in place of the code in PBE and then running the GPU at full blast.  If you set the DPM7 to 0.9 it would still pull 1.25 using your example and not 0.9.

Last point, WattTool tells some info but if you want a tool to profile it is much better to use AIDA


Thank you, that sounds absolutely correct! I didn't know( I confess ) that DPM voltage is determined by ASIC quality(much to learn, obviously). That sure clarifies all.

You are very welcome.  BTW- take a look a AIDA - you can print off the actual voltage at each DPM for each individual GPU - it can give you all kinds of great info in addition to that
It will be a bit difficult Smiley I rarely use windows and if I do it is for 1 GPu only. I will
later see on my hynix the output of aida indeed.

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 01:41:15 PM
 #446

Tried 1200/2100 on my Samsung 480 - 30.47MH/s; doesn't matter if I use 4.10 or 4.11rc3.

Thanks for the info; saved me the trouble of testing 4.11.

As for the speed, is that with your Polaris-tuned kernel or stock sgminer-gm 5.5.5?  I still can't get over 29.3 at 1212/2100 (same speed as 1167/2100).
The card is capable of >30, since I get ~30.1 from Claymore 8.1 (1167/2100).
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 01:48:54 PM
 #447


My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15

My strap is for K4G4.  Likely different mode registers than your 8G Samsung.
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 01:51:44 PM
 #448

Tried 1200/2100 on my Samsung 480 - 30.47MH/s; doesn't matter if I use 4.10 or 4.11rc3.

Thanks for the info; saved me the trouble of testing 4.11.

As for the speed, is that with your Polaris-tuned kernel or stock sgminer-gm 5.5.5?  I still can't get over 29.3 at 1212/2100 (same speed as 1167/2100).
The card is capable of >30, since I get ~30.1 from Claymore 8.1 (1167/2100).


Does over 30 on both. Timings are pretty good, though.

I'm using worksize 192, gpu-threads 2, and xi1024.  You using anything different?
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 04:18:05 PM
 #449

I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.
laik2
Sr. Member
****
Offline Offline

Activity: 652
Merit: 266



View Profile WWW
March 26, 2017, 04:45:53 PM
 #450

I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.

Here's the correct structure, it was stated earlier in niko2004x's post.
RX
Quote
struct {
        uint32_t trp_wra : 7 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad0  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;

R9
Quote
struct {
        uint32_t trp_wra : 6 ;
        uint32_t m1Pad0  : 2 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;

Miners Mining Platform [ MMP OS ] - https://app.mmpos.eu/
rednoW
Legendary
*
Offline Offline

Activity: 1510
Merit: 1003


View Profile
March 26, 2017, 05:02:02 PM
 #451


My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15

My strap is for K4G4.  Likely different mode registers than your 8G Samsung.


thanks for the answer!
Any hint for 8G samsung? My card has no mem cooling so I need some good timings for 2050-2100 range.
Eliovp
Legendary
*
Offline Offline

Activity: 1050
Merit: 1294

Huh?


View Profile WWW
March 26, 2017, 05:17:19 PM
 #452


My Samsung strap was posted before, but here it is again:
777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 010103139962C3617

With 2100 mclk I'm getting 29-29.1 with sgminer-gm 5.5.5, kernel 4.10.5 & AMDGPU 16.60.

doesn't work on my power color red dragon rx480 8gm samsung mem.
windows driver crash when start ETH, both 2050 and 2100 mem clock.
works good on 2100 clock with 1750 (1625?) strap from original rom
777000000000000022339D00CECD593980551111AE8A84080048C6006C0014206A8900A00200312 0140F262B88252F15

My strap is for K4G4.  Likely different mode registers than your 8G Samsung.


thanks for the answer!
Any hint for 8G samsung? My card has no mem cooling so I need some good timings for 2050-2100 range.

There is some serious difference in Samsung Modlues, if you're lucky you can have one like i have.

one that easily breaks 34Mh on ETH.. i'm still surprised myself to be honest..

GPU0 - 1220 / 2150
GPU1 - 1180 / 2100
GPU2 - 1220 / 2100
GPU3 - 1220 / 2100
GPU4 - 1220 / 2100
GPU5 - 1220 / 2100



yeah.. strange indeed  Roll Eyes

nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 05:34:26 PM
 #453

I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.

Here's the correct structure, it was stated earlier in niko2004x's post.
RX
Quote
struct {
        uint32_t trp_wra : 7 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad0  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;

R9
Quote
struct {
        uint32_t trp_wra : 6 ;
        uint32_t m1Pad0  : 2 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;


I think mine is better.  I think there is really 7 bits for RP_RDA, and 6 for tRP, but none of the existing straps use the highest bit.

nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 05:42:55 PM
 #454

Any simple suggestions for a Hynix strap:
999000000000000022559D0010DE5B4480551312B74C450A00400601750414206A8900A00200312 010112D34A42A3816

It's based on the 1625 strap, since the 1500 strap wasn't stable for me beyond 1875.  The 1625 is stable at 1950, but slow (barely over 24Mh), and that's after RRD=5, zeroing FAW and 32AW, ACTRD=16.  I also dropped tRP from 22 to 21, but that didn't make a difference in speed.  From work with other straps, I don't think tightening RAS any more will help.  Any quick tips?

nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 05:49:40 PM
 #455

There is some serious difference in Samsung Modlues, if you're lucky you can have one like i have.

one that easily breaks 34Mh on ETH.. i'm still surprised myself to be honest..

GPU0 - 1220 / 2150
GPU1 - 1180 / 2100
GPU2 - 1220 / 2100
GPU3 - 1220 / 2100
GPU4 - 1220 / 2100
GPU5 - 1220 / 2100



yeah.. strange indeed  Roll Eyes

It's impossible to do >34Mh with a 2150 mem clock.  With 2250, maybe, but certainly not at 2150.
lexele
Full Member
***
Offline Offline

Activity: 190
Merit: 100


View Profile
March 26, 2017, 06:25:49 PM
 #456

It's based on the 1625 strap, since the 1500 strap wasn't stable for me beyond 1875.
That's very strange, I have several rx with hynix and the 1500 strap is the most stable at least up to 2050.
Truthchanter
Sr. Member
****
Offline Offline

Activity: 689
Merit: 253


View Profile
March 26, 2017, 06:37:52 PM
Last edit: March 26, 2017, 07:55:54 PM by Truthchanter
 #457

Are there any publicly available custom memory strap timings for the rx 400 series? (samsung, elpida, or hynix)

Here's the Samsung strap I'm currently working on:
777000000000000022CC1C0010625C49D0571016B50BD50900400700140514207A8900A00300000 0191131399D2C3617

It's the 1750 strap with RRD=5, FAW&32AW=0.  It's stable at 2100 on my Sapphire Rx470.  The previous custom strap I tried was based on the 1625 strap, and I would start getting a lot of errors over 2000.  I just started working on it today, so there's more tweaking to do (like trying a lower tRC for RAS).

Nicely done Smiley
Lets see the final results.

Small adjustments to your timing based on my experience : 777000000000000022CC1C00CE615C45C0571016B30CD50900400700140514207A8900A00300000 0151031399D2C3617


Thanks! Nice improvement over the regular 1750 strap. Now the question is... will this 1750 samsung strap work with 480s with samsung (and then both 4gb and 8gb?)

If you have one of the first 4G samsung batches, it will work and even improve even more. (because you can unlock those to 8G)
If you have a newer batch of those 4G samsung's, it will most probably run but won't run stable.

Greetings

Why won't it be stable? Would this only be stable for reference cards?

EDIT: after reading for a bit I saw something about it was only for K4G4 (something like this?) which im guessing means theres different samsung memory modules
nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 07:43:47 PM
 #458

It's based on the 1625 strap, since the 1500 strap wasn't stable for me beyond 1875.
That's very strange, I have several rx with hynix and the 1500 strap is the most stable at least up to 2050.

He's on Samsung - that's why - Hynix/Elpida, usually 1500 is fine.

No, that's my Asus Strix 470 with Hynix.  My Sapphire ref has Samsung.
And I figured out my problem was 32AW.  I thought I had zero'd it but it was still at 8.  With 32AW=0 now it gets ~27.1 instead of ~24.2 at 1950.
999000000000000022559D0010DE5B4480551312B74C450A00400600750414206A8900A00200312 010112D34A42A3816
niko2004x
Member
**
Offline Offline

Activity: 126
Merit: 10


View Profile
March 26, 2017, 07:49:22 PM
 #459

I just figured out ohgodadecode has the wrong structure for MISC_TIMING.  Here's a Hynix 1625 strap that shows it: 999000000000000022559D0010DE7B4480551312B78C450A004C0601750414206A8900A00200312 018112D34A42A3816

Here's how I think it should be:
Code:
typedef struct _SEQ_MISC_TIMING_FORMAT
{
    uint32_t TRP_WRA : 7;
    uint32_t TRP_RDA : 7;
    uint32_t TRP : 6;
    uint32_t TRFC : 9;
    uint32_t Pad0 : 3;
} SEQ_MISC_TIMING_FORMAT;

I think the way it was defined is probably correct for R9 series cards, not Rx.

Here's the correct structure, it was stated earlier in niko2004x's post.
RX
Quote
struct {
        uint32_t trp_wra : 7 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad0  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;

R9
Quote
struct {
        uint32_t trp_wra : 6 ;
        uint32_t m1Pad0  : 2 ;
        uint32_t trp_rda : 6 ;
        uint32_t m1Pad1  : 1 ;
        uint32_t trp     : 5 ;
        uint32_t trfc    : 9 ;
        uint32_t m1Pad2  : 3 ;
} mc_seq_misc_timing;


Nice to know we all arrived to the same structures.


I think mine is better.  I think there is really 7 bits for RP_RDA, and 6 for tRP, but none of the existing straps use the highest bit.


Easy to test.
Set RP_RDA and/or TRP to 0 up to the highest bit (set it to 1) for some high mclk strap. Did not check myself if it is 'crash and burn' or working.
About existing straps you are right.
For RX series maximum values in existings straps are RP_RDA=033, TRP=029 in cards with Micron MT51J256M32HF and Elpida EDW4032BABG.


nerdralph
Sr. Member
****
Offline Offline

Activity: 588
Merit: 251


View Profile
March 26, 2017, 07:52:15 PM
 #460

It's based on the 1625 strap, since the 1500 strap wasn't stable for me beyond 1875.
That's very strange, I have several rx with hynix and the 1500 strap is the most stable at least up to 2050.

Maybe your cards have better cooling for the RAM?  This is an Asus Strix.  After tuning the 1625 strap, it was stable at ~27.2Mh with a 1950 mem clock.  Now I'm testing 2000.
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [23] 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!