Bitcoin Forum
September 26, 2017, 09:41:24 PM *
News: Latest stable version of Bitcoin Core: 0.15.0.1  [Torrent]. (New!)
 
   Home   Help Search Donate Login Register  
Pages: « 1 ... 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 [77] 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 ... 1145 »
  Print  
Author Topic: [ANN] cudaMiner & ccMiner CUDA based mining applications [Windows/Linux/MacOSX]  (Read 3287720 times)
69charger
Full Member
***
Offline Offline

Activity: 173


View Profile
December 17, 2013, 06:05:16 PM
 #1521

What seems apparent already is that there is no 20-30% speed gain. Weird.

On a non overclocked 780Ti I was going from ~440 kHash to 487 kHash  (a 10% improvement)
On a GT 750M I was going from 55 kHash/s to 59 kHash/s (no texture read caching implemented so far)

Well a 10% increase is better than a kick in the teeth Smiley

Just a question, would it not be worth trying to contact the author and ask him if he can take a quick look and see if there are any other improvements he could make for us?

We're chatting.  I suspect that what it's going to come down to is:
  - OK speedups on some cards (5-10%) using Cuda 5.0
  - Decent speedups on some other cards (20%) using Cuda 5.5
  - Really good speedups (80%) on some low-cuda-core-count mobile platforms using Cuda 5.5.

It's a little tricky, though - there's something in my code that really likes Cuda 5.5, and the previous code targets Cuda 5.0 in order to be more widely usable.  Likely nothing that can't be solved, but it may take some work.

Thanks to both Christian and Dave for all the hard work!

Anxiously awaiting your masterpiece on two GTX 660's  Grin
          ▄█████▄
        ▄█████████▄
      ▄████▀   ▀████▄
    ▄████▀   ▄ ▄█▀████▄
  ▄████▀   ▄███▀   ▀████▄
▄████▀   ▄███▀   ▄   ▀████▄
█████   ███▀   ▄███   █████
▀████▄   ▀██▄▄███▀   ▄████▀
  ▀████▄   ▀███▀   ▄████▀
    ▀████▄       ▄████▀
      ▀████▄   ▄████▀
        ▀███  ████▀
          ▀█▄███▀
.
|
.
|
          ▄█████▄
        ▄█████████▄
      ▄████▀   ▀████▄
    ▄████▀   ▄ ▄█▀████▄
  ▄████▀   ▄███▀   ▀████▄
▄████▀   ▄███▀   ▄   ▀████▄
█████   ███▀   ▄███   █████
▀████▄   ▀██▄▄███▀   ▄████▀
  ▀████▄   ▀███▀   ▄████▀
    ▀████▄       ▄████▀
      ▀████▄   ▄████▀
        ▀███  ████▀
          ▀█▄███▀
unthy
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction. Advertise here.
1506462084
Hero Member
*
Offline Offline

Posts: 1506462084

View Profile Personal Message (Offline)

Ignore
1506462084
Reply with quote  #2

1506462084
Report to moderator
juggs
Newbie
*
Offline Offline

Activity: 28


View Profile
December 18, 2013, 02:29:51 AM
 #1522

I know ~nothing~ about coding this stuff, so forgive me if this is a dumb question.

How hard would it be to have cudaminer display the currently mined scrypt's difficulty, the GPU utilisation, temp and fan speed %ge?

I only ask as I used to get to get temp and fan speed info through nVidia's X Server Settings application (on linux). Now I've installed the latest nVidia drivers and CUDA from source, moved my desktop display to the motherboard onboard graphics and run cudaminer with the non-interactive flag (works fine) that route no longer works to query the information. I've looked around but not been able to find any way to query the activity of the card.

I probably missed something obvious - so all suggestions welcome Smiley
juggs
Newbie
*
Offline Offline

Activity: 28


View Profile
December 18, 2013, 02:43:51 AM
 #1523

And having looked for hours, having posted the above I find a way that should query those things.

/usr/bin/nvidia-smi -a

Sadly it is mostly N/A with my card (GTS 250)
Code:
==============NVSMI LOG==============

Timestamp                           : Wed Dec 18 02:40:11 2013
Driver Version                      : 319.37

Attached GPUs                       : 1
GPU 0000:02:00.0
    Product Name                    : GeForce GTS 250
    Display Mode                    : N/A
    Display Active                  : N/A
    Persistence Mode                : Disabled
    Accounting Mode                 : N/A
    Accounting Mode Buffer Size     : N/A
    Driver Model
        Current                     : N/A
        Pending                     : N/A
    Serial Number                   : N/A
    GPU UUID                        : GPU-83625be0-08b8-2a8c-44c8-82f560e2d9b7
    VBIOS Version                   : 62.92.7E.00.00
    Inforom Version
        Image Version               : N/A
        OEM Object                  : N/A
        ECC Object                  : N/A
        Power Management Object     : N/A
    GPU Operation Mode
        Current                     : N/A
        Pending                     : N/A
    PCI
        Bus                         : 0x02
        Device                      : 0x00
        Domain                      : 0x0000
        Device Id                   : 0x061510DE
        Bus Id                      : 0000:02:00.0
        Sub System Id               : 0x110319DA
        GPU Link Info
            PCIe Generation
                Max                 : N/A
                Current             : N/A
            Link Width
                Max                 : N/A
                Current             : N/A
    Fan Speed                       : 43 %
    Performance State               : N/A
    Clocks Throttle Reasons         : N/A
    Memory Usage
        Total                       : 511 MB
        Used                        : 235 MB
        Free                        : 276 MB
    Compute Mode                    : Default
    Utilization
        Gpu                         : N/A
        Memory                      : N/A
    Ecc Mode
        Current                     : N/A
        Pending                     : N/A
    ECC Errors
        Volatile
            Single Bit           
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Total               : N/A
            Double Bit           
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Total               : N/A
        Aggregate
            Single Bit           
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Total               : N/A
            Double Bit           
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Total               : N/A
    Retired Pages
        Single Bit ECC              : N/A
        Double Bit ECC              : N/A
        Pending                     : N/A
    Temperature
        Gpu                         : 64 C
    Power Readings
        Power Management            : N/A
        Power Draw                  : N/A
        Power Limit                 : N/A
        Default Power Limit         : N/A
        Enforced Power Limit        : N/A
        Min Power Limit             : N/A
        Max Power Limit             : N/A
    Clocks
        Graphics                    : N/A
        SM                          : N/A
        Memory                      : N/A
    Applications Clocks
        Graphics                    : N/A
        Memory                      : N/A
    Default Applications Clocks
        Graphics                    : N/A
        Memory                      : N/A
    Max Clocks
        Graphics                    : N/A
        SM                          : N/A
        Memory                      : N/A
    Compute Processes               : N/A

That's with cudaminer running - all I have to go on is the temp and fan speed. Doesn't seem to be working that hard really - it used to get hotter and higher fans than that gaming on it.

Any pointer welcome Smiley
69charger
Full Member
***
Offline Offline

Activity: 173


View Profile
December 18, 2013, 04:57:44 AM
 #1524

Does the massive update on GitHub mean a release is close HuhShocked Shocked
cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 05:34:53 AM
 #1525

Does the massive update on GitHub mean a release is close HuhShocked Shocked

I present the 2013-12-18 release with all new Kepler, Titan kernels. Huge thanks to David Andersen who came up with a more efficient way to do scrypt mining on the Kepler architecture.

We now use CUDA 5.5, which has increased driver requirements. Be sure to use a recent nVidia driver.

When mining on Kepler devices, please autotune again and report your findings.

Christian

cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 05:45:17 AM
 #1526

Here are some configs that work for me (there may exist better ones, of course)

GTX 780Ti: -l T15x24    550 kHash/s  (106% TDP, +100 core clock, +200 mem clock)
GTX 660Ti: -l K14x8      250 kHash/s
GT 640 (GK208 based): -l T9x20     92 kHash/s

all of this is significantly above what previous cudaminer releases would achieve.

Careful, your cards may run hotter than before.
blackraven1425
Member
**
Offline Offline

Activity: 98


View Profile
December 18, 2013, 05:49:53 AM
 #1527

Any advice on x64 vs x86 with the new version?

EDIT: With a 670 GTX and the same settings as before, I bumped from 160ish khash to 190 khash. Awesome work, cbuchner! Also, thanks to David Anderson!

Settings: -i 1 -l K90x2 -C 1 -H 1 -m 1
cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 05:51:04 AM
 #1528

Any advice on x64 vs x86 with the new version?

I stick to x86, as generally the register pressure is lower (CUDA kernels run faster) and the CPU is mostly idle, anyways.
cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 05:54:05 AM
 #1529

And having looked for hours, having posted the above I find a way that should query those things.

/usr/bin/nvidia-smi -a

Sadly it is mostly N/A with my card (GTS 250)

https://github.com/CFSworks/nvml_fix

fixed this for me. I had to install gcc 4.4 to compile this hack.

Christian
69charger
Full Member
***
Offline Offline

Activity: 173


View Profile
December 18, 2013, 05:54:23 AM
 #1530

Went from 152 per card to 212!!! Actually uses the max power limit now. Cards are hotter but only 66c, before they were 60c.

K5x12 on two MSI GTX 660 TF3/OC cards.

THANK YOU!!!!

edit: Bumped up fans to 56% and I am back under 60c. Desktop performance is as if I don't have cudaminer running at all! Amazing update!!
cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 06:04:10 AM
 #1531

Settings: -i 1 -l K90x2 -C 1 -H 1 -m 1

You should really autotune again. The x2 is probably inefficient. Also the -C (texture cache) argument is currently ignored with David's new kernels. I will try to integrate this feature again if it brings an additional speed benefit.
y3804
Sr. Member
****
Offline Offline

Activity: 336


View Profile
December 18, 2013, 06:07:16 AM
 #1532

No love for fermi cards? Smiley
cbuchner1
Hero Member
*****
Offline Offline

Activity: 742


View Profile
December 18, 2013, 06:08:28 AM
 #1533

No love for fermi cards? Smiley

Fermi has been loved enough.
vosovich
Newbie
*
Offline Offline

Activity: 28


View Profile
December 18, 2013, 06:09:21 AM
 #1534

My hashrate improved by about 8% on a Fermi kernel, so I'm not complaining.
blackraven1425
Member
**
Offline Offline

Activity: 98


View Profile
December 18, 2013, 06:12:06 AM
 #1535

Settings: -i 1 -l K90x2 -C 1 -H 1 -m 1

You should really autotune again. The x2 is probably inefficient. Also the -C (texture cache) argument is currently ignored with David's new kernels. I will try to integrate this feature again if it brings an additional speed benefit.


Yeah, it was just a quick check before I went into a repeated autotune frenzy.

Speaking of which, there's some really good news on that front. With K42x5, I'm now at a whopping 230khash, still on interactive mode! Incredible job here!

Side question, do you have a Bitcoin addy? I don't mine LTC.

EDIT: 28x8 is at 235khash. Still have a few more to check out, but this is probably where I'm going to top out.

EDIT2: The oddball 14x24 is just a hair faster at 236.
69charger
Full Member
***
Offline Offline

Activity: 173


View Profile
December 18, 2013, 06:32:52 AM
 #1536

Seriously, what is the deal with the desktop. My cards are cranking like they've never before and desktop is more responsive than the old -i 1 mode even though I am set at -i 1?
blackraven1425
Member
**
Offline Offline

Activity: 98


View Profile
December 18, 2013, 06:45:13 AM
 #1537

Seriously, what is the deal with the desktop. My cards are cranking like they've never before and desktop is more responsive than the old -i 1 mode even though I am set at -i 1?

I'm noticing this too. There's slightly less stutter on video (had Pawn Stars on this whole time while testing configs). Not the very slight nearly constant stutter I've always seen, but the larger occasional stutters seem to be gone.
mmihai1978
Sr. Member
****
Offline Offline

Activity: 364


View Profile
December 18, 2013, 07:01:10 AM
 #1538

i see massive jump on gtx 760 from 152 kh/s to 250 kh/s.

thanks

-H 1 -d 0 -i 1 -l auto -C 2 -m 1
Notanon
Sr. Member
****
Offline Offline

Activity: 385


Pastor of Muppets


View Profile
December 18, 2013, 07:02:51 AM
 #1539

Sitting at around 178kH on autotune with the new version with my GTX 660, compared to 140 with optimised settings on the previous version and around 125 with autotune on the previous version. Going to put in the previous settings and see if it makes a noticeable difference.

Edit: Sitting around 196kH with optimised settings, so it's approximately a 40% jump in output compared to the previous version. Very pleased with the performance. Cheesy

AMHash
ASICMINERROCKMINER ● Purchase from: AMHash (20Th/s min) ● Havelock (1Gh/s min)
Cloud-mining contracts: 0.0012 BTC per Gh ● Maintenance fee: $0.001551 per Gh per day ● Upto 6% Christmas Bonus

kernels10
Sr. Member
****
Offline Offline

Activity: 408


ded


View Profile
December 18, 2013, 08:26:16 AM
 #1540

My 660Ti was getting 215 (+/- 5) khash/s  before.
Started new version and was getting 265 (+/- 5).
I left my computer for ~40 mins and now its at a steady 285 (+/- 5) !!!

Great Work!
Pages: « 1 ... 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 [77] 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 ... 1145 »
  Print  
 
Jump to:  

Sponsored by , a Bitcoin-accepting VPN.
Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!