Bitcoin Forum
April 30, 2024, 06:55:23 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 ... 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 [291] 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 ... 417 »
  Print  
Author Topic: [OS] nvOC easy-to-use Linux Nvidia Mining  (Read 417954 times)
Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 27, 2017, 05:49:44 PM
 #5801

Why my rig stops after 15 hours ?
I have 4X geeforce windforce gtx 1070

Temp: GPU0: 70C GPU1: 67C GPU2: 59C GPU3: 69C
GPU0: 443 Sol/s GPU1: 445 Sol/s GPU2: 453 Sol/s GPU3: 445 Sol/s
Total speed: 1786 Sol/s
+-----+-------------+--------------+
| GPU | Power usage |  Efficiency  |
+-----+-------------+--------------+
|  0  |    129W     |  3.43 Sol/W  |
|  1  |    135W     |  3.30 Sol/W  |
|  2  |    134W     |  3.38 Sol/W  |
|  3  |    133W     |  3.35 Sol/W  |
+-----+-------------+--------------+


I have a corsair 1200W
MB: ASUS PRIME Z270-A
now power wall is only 615W

power limit at 140W each one
__CORE_OVERCLOCK_1=150
MEMORY_OVERCLOCK_1=580







Not enough info
Check your wdog logs, may be utilization dropped, internet had a hiccup, or ....  


Mon Nov 27 00:30:56 CET 2017 - reboot in 10 seconds
Mon Nov 27 00:46:48 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 01:32:44 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 04:47:39 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 05:03:25 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 05:40:57 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 05:55:32 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 06:54:42 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 08:31:25 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 10:01:26 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 11:03:27 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 13:13:01 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 13:52:24 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:01:38 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:01:48 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:01:58 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:02:08 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:02:18 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 15:02:28 CET 2017 - Utilization is too low: restart 3main
Mon Nov 27 15:03:41 CET 2017 - Utilization is too low: restart 3main
Mon Nov 27 15:04:53 CET 2017 - Utilization is too low: restart 3main
Mon Nov 27 15:06:05 CET 2017 - Utilization is too low: restart 3main
Mon Nov 27 15:07:20 CET 2017 - Utilization is too low: restart 3main
Mon Nov 27 15:08:32 CET 2017 - Utilization is too low: reviving did not work so restarting system in 10 seconds

Mon Nov 27 16:57:48 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 17:11:23 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 17:45:45 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 18:14:25 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures
Mon Nov 27 18:17:36 CET 2017 - Low Utilization Detected: 3main will reinit if there are 6 consecutive failures

Ok, we see from that that the watchdog is detecting low utilization and restarting the miner and after 5 miner restarts, reboots the host. That is the normal behavior of the watchdog script. Now we have to find out why the miner is not mining. Can you check /home/m1/screenlog.0 for errors from the miner?
1714503323
Hero Member
*
Offline Offline

Posts: 1714503323

View Profile Personal Message (Offline)

Ignore
1714503323
Reply with quote  #2

1714503323
Report to moderator
The Bitcoin software, network, and concept is called "Bitcoin" with a capitalized "B". Bitcoin currency units are called "bitcoins" with a lowercase "b" -- this is often abbreviated BTC.
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
bwillyb
Newbie
*
Offline Offline

Activity: 5
Merit: 0


View Profile
November 27, 2017, 06:04:28 PM
 #5802

screenlog seems to be till 20 november
bwillyb
Newbie
*
Offline Offline

Activity: 5
Merit: 0


View Profile
November 27, 2017, 06:28:16 PM
 #5803

ok Found
Total speed: 1811 Sol/s
INFO 15:01:17: GPU0 Accepted share 35ms [A:993, R:0]
INFO: Detected new work: d2d1
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Stratum subscribe timeout

INFO: Target: 0003333333333333...
INFO: Detected new work: cce3
CUDA: Device: 0 GeForce GTX 1070, 8112 MB PCI: 0000:01:00.0
CUDA: Device: 1 GeForce GTX 1070, 8113 MB PCI: 0000:03:00.0
CUDA: Device: 2 GeForce GTX 1070, 8113 MB PCI: 0000:04:00.0
CUDA: Device: 3 GeForce GTX 1070, 8113 MB PCI: 0000:05:00.0
CUDA: Device: 1 Selected solver: 0
CUDA: Device: 0 Selected solver: 0
CUDA: Device: 2 Selected solver: 0
CUDA: Device: 3 Selected solver: 0
INFO: Detected new work: cce4
INFO 15:16:20: GPU3 Accepted share 37ms [A:1, R:0]
INFO 15:16:26: GPU3 Accepted share 37ms [A:2, R:0]
Temp: GPU0 53C GPU1 51C GPU2 49C GPU3 51C
GPU0: 457 Sol/s GPU1: 456 Sol/s GPU2: 464 Sol/s GPU3: 454 Sol/s
Total speed: 1831 Sol/s
papampi
Full Member
***
Offline Offline

Activity: 686
Merit: 140


Linux FOREVER! Resistance is futile!!!


View Profile WWW
November 27, 2017, 06:43:05 PM
 #5804


I tested both scenarios with a small change in your Debug code
Used ewbf and set threshold to 100 to see what happenes

added this at the end before done
Code:
    echo "Debug: JEEP=$JEEP, COUNT=$COUNT, RESTART=$RESTART REBOOTRESET=$REBOOTRESET"

With the original code, wdog resets REBOOTRESET after 5 cycles, with your proposal REBOOTRESET keeps adding up, and wont resets after 5.
So my conclusion is the original code should be correct.  


With the THRESHOLD set to 100, you would almost always have a utilization error so the miner would be restarting and eventually the host. This code is not intended for that situation. In fact, your test proves just the opposite of your conclusion. The REBOOTRESET should continue to increase until such a point as the watchdog detects normal mining operations at which point it will hit the else part part of the if (JEEP=0) and will find that REBOOTRESET is greater than 5 and will then set both REBOOTRESET and RESTART to 0.

REBOOTRESET increases by every cycle (10 seconds), no matter if there is a low util, no miner, ...
Take a look at the code you see its on top, before checking utilization

Edit:
I have only 2 cards on my test rig, so I set threshold to 100 to see whats happening when there is low utilization

But when I read your logic again, that should be it, not the way it is now
will test more later

Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 27, 2017, 06:54:41 PM
 #5805

ok Found
Total speed: 1811 Sol/s
INFO 15:01:17: GPU0 Accepted share 35ms [A:993, R:0]
INFO: Detected new work: d2d1
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Stratum subscribe timeout

INFO: Target: 0003333333333333...
INFO: Detected new work: cce3
CUDA: Device: 0 GeForce GTX 1070, 8112 MB PCI: 0000:01:00.0
CUDA: Device: 1 GeForce GTX 1070, 8113 MB PCI: 0000:03:00.0
CUDA: Device: 2 GeForce GTX 1070, 8113 MB PCI: 0000:04:00.0
CUDA: Device: 3 GeForce GTX 1070, 8113 MB PCI: 0000:05:00.0
CUDA: Device: 1 Selected solver: 0
CUDA: Device: 0 Selected solver: 0
CUDA: Device: 2 Selected solver: 0
CUDA: Device: 3 Selected solver: 0
INFO: Detected new work: cce4
INFO 15:16:20: GPU3 Accepted share 37ms [A:1, R:0]
INFO 15:16:26: GPU3 Accepted share 37ms [A:2, R:0]
Temp: GPU0 53C GPU1 51C GPU2 49C GPU3 51C
GPU0: 457 Sol/s GPU1: 456 Sol/s GPU2: 464 Sol/s GPU3: 454 Sol/s
Total speed: 1831 Sol/s


Well, that appears to be the answer to your original question. Your rig stopped mining after 15 hours because the miner lost connectivity to the pool. Either your rig lost connectivity or the pool was down.
Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 27, 2017, 07:06:43 PM
Last edit: November 27, 2017, 07:29:42 PM by Stubo
 #5806


I tested both scenarios with a small change in your Debug code
Used ewbf and set threshold to 100 to see what happenes

added this at the end before done
Code:
    echo "Debug: JEEP=$JEEP, COUNT=$COUNT, RESTART=$RESTART REBOOTRESET=$REBOOTRESET"

With the original code, wdog resets REBOOTRESET after 5 cycles, with your proposal REBOOTRESET keeps adding up, and wont resets after 5.
So my conclusion is the original code should be correct.  


With the THRESHOLD set to 100, you would almost always have a utilization error so the miner would be restarting and eventually the host. This code is not intended for that situation. In fact, your test proves just the opposite of your conclusion. The REBOOTRESET should continue to increase until such a point as the watchdog detects normal mining operations at which point it will hit the else part part of the if (JEEP=0) and will find that REBOOTRESET is greater than 5 and will then set both REBOOTRESET and RESTART to 0.

REBOOTRESET increases by every cycle (10 seconds), no matter if there is a low util, no miner, ...
Take a look at the code you see its on top, before checking utilization

Edit:
I have only 2 cards on my test rig, so I set threshold to 100 to see whats happening when there is low utilization

But when I read your logic again, that should be it, not the way it is now
will test more later

Yeah, when looking at the code, check the other places where REBOOTRESET is changed. Note it is just after 3main is killed:
Code:
<cut>
         echo ""
         RESTART=$(($RESTART + 1))
         REBOOTRESET=0
         COUNT=$GPU_COUNT
<cut>

So, I think the intent of developer was to find a way to reset variable RESTART to 0 to avoid unnecessary host reboots and the best way to do that is only if you can make it through the main loop without any "below utilization" problems.

Also, another minor bug is that REBOOTRESET is never initialized at the beginning of the script. The first reference to it is when it is incremented by 1 within the main [while] loop. I am not sure why that doesn't throw an error.
bwillyb
Newbie
*
Offline Offline

Activity: 5
Merit: 0


View Profile
November 27, 2017, 07:17:24 PM
 #5807

ok Found
Total speed: 1811 Sol/s
INFO 15:01:17: GPU0 Accepted share 35ms [A:993, R:0]
INFO: Detected new work: d2d1
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Cannot connect to the pool
ERROR: Lost connection with the server.
INFO: Attempt to restore connection.
ERROR: Stratum subscribe timeout

INFO: Target: 0003333333333333...
INFO: Detected new work: cce3
CUDA: Device: 0 GeForce GTX 1070, 8112 MB PCI: 0000:01:00.0
CUDA: Device: 1 GeForce GTX 1070, 8113 MB PCI: 0000:03:00.0
CUDA: Device: 2 GeForce GTX 1070, 8113 MB PCI: 0000:04:00.0
CUDA: Device: 3 GeForce GTX 1070, 8113 MB PCI: 0000:05:00.0
CUDA: Device: 1 Selected solver: 0
CUDA: Device: 0 Selected solver: 0
CUDA: Device: 2 Selected solver: 0
CUDA: Device: 3 Selected solver: 0
INFO: Detected new work: cce4
INFO 15:16:20: GPU3 Accepted share 37ms [A:1, R:0]
INFO 15:16:26: GPU3 Accepted share 37ms [A:2, R:0]
Temp: GPU0 53C GPU1 51C GPU2 49C GPU3 51C
GPU0: 457 Sol/s GPU1: 456 Sol/s GPU2: 464 Sol/s GPU3: 454 Sol/s
Total speed: 1831 Sol/s


Well, that appears to be the answer to your original question. Your rig stopped mining after 15 hours because the miner lost connectivity to the pool. Either your rig lost connectivity or the pool was down.

Sure zcash.pro was down at 15:00
Thanks
TheNewEthlite
Newbie
*
Offline Offline

Activity: 8
Merit: 0


View Profile
November 27, 2017, 08:26:21 PM
 #5808

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.
https://imgur.com/a/QL5K8

Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.
Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 27, 2017, 08:30:04 PM
 #5809

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.


Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.


That appears to be a filesystem problem. Are you running nvOC from a USB stick or SSD?
papampi
Full Member
***
Offline Offline

Activity: 686
Merit: 140


Linux FOREVER! Resistance is futile!!!


View Profile WWW
November 27, 2017, 08:31:21 PM
 #5810

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.


Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.


Looks like a USB reached end of it's life
just grab a 30$ SSD and save your self

TheNewEthlite
Newbie
*
Offline Offline

Activity: 8
Merit: 0


View Profile
November 27, 2017, 08:59:08 PM
 #5811

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.
https://imgur.com/a/QL5K8

Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.


That appears to be a filesystem problem. Are you running nvOC from a USB stick or SSD?

Tis a USB, but worked perfectly fine before upgrading to 1.4
CyberGI
Newbie
*
Offline Offline

Activity: 14
Merit: 0


View Profile
November 28, 2017, 12:17:34 AM
 #5812

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.
https://imgur.com/a/QL5K8

Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.


That appears to be a filesystem problem. Are you running nvOC from a USB stick or SSD?

Tis a USB, but worked perfectly fine before upgrading to 1.4

I ran into a similar issue with a pretty high quality USB drive. I extended the partitions using the tool included in nvOS and that bought me about an extra week. Fortunately, that was more time than it took for a new, 60GB SSD to be delivered.
Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 28, 2017, 12:52:19 AM
 #5813

Hello,

got a problem with rig stop mining, doesn't autoreboot, screen is all green with warning box with nondescript characters. Hard reboot required, attached error screen on reboot.


Asus H270-plus
6x 1080ti
intel G3930

Any help would be much appreciated.


That appears to be a filesystem problem. Are you running nvOC from a USB stick or SSD?

Tis a USB, but worked perfectly fine before upgrading to 1.4

I ran into a similar issue with a pretty high quality USB drive. I extended the partitions using the tool included in nvOS and that bought me about an extra week. Fortunately, that was more time than it took for a new, 60GB SSD to be delivered.

I have been using these and have had no issues:

https://www.amazon.com/Silicon-Power-Endurance-Free-download-SP060GBSS3S60S25AE/dp/B01M2UUACN/ref=sr_1_13?s=pc&ie=UTF8&qid=1511829978&sr=1-13&keywords=ssd

or if you are really in a hurry and live in the states in a metro area, you can pay a little bit more for a very similar one and get it that day:

https://www.amazon.com/Silicon-Power-Performance-Free-download-SP060GBSS3S55S25/dp/B00D4AVN3G/ref=sr_1_12?s=pc&ie=UTF8&qid=1511830241&sr=1-12&keywords=ssd

Hope this helps.
Alienbert
Newbie
*
Offline Offline

Activity: 26
Merit: 0


View Profile
November 28, 2017, 12:58:34 AM
 #5814

Hi guys!

I mined Zcash and later ZEN! Now i tried NICE_EQUIHASH - start the terminal - settings loaded - screen terminated

So why i cannot mine on Nicehash? What happens?

Thanks for answer

Nice greets
Stubo
Member
**
Offline Offline

Activity: 224
Merit: 13


View Profile
November 28, 2017, 01:00:10 AM
 #5815

Hi guys!

I mined Zcash and later ZEN! Now i tried NICE_EQUIHASH - start the terminal - settings loaded - screen terminated

So why i cannot mine on Nicehash? What happens?

Thanks for answer

Nice greets

Check screenlog.0 in your home directory for clues.
Alienbert
Newbie
*
Offline Offline

Activity: 26
Merit: 0


View Profile
November 28, 2017, 01:14:30 AM
 #5816

Hi guys!

I mined Zcash and later ZEN! Now i tried NICE_EQUIHASH - start the terminal - settings loaded - screen terminated

So why i cannot mine on Nicehash? What happens?

Thanks for answer

Nice greets

Check screenlog.0 in your home directory for clues.

The miner dont starts at NICE_EQUIHASH, so i have no screens in the screenlog.0
6x1070ethsia
Newbie
*
Offline Offline

Activity: 5
Merit: 0


View Profile
November 28, 2017, 02:00:46 AM
 #5817

I cannot get my hashes for a gtx 1070 to get past 26ish mining eth no matter what I do.

Running nvOC 19 1.4

Is 26 just what linux can do or do I need to keep trying to solve this issue and get higher hashes?
hawkfish007
Hero Member
*****
Offline Offline

Activity: 895
Merit: 504


View Profile
November 28, 2017, 02:48:19 AM
 #5818

I cannot get my hashes for a gtx 1070 to get past 26ish mining eth no matter what I do.

Running nvOC 19 1.4

Is 26 just what linux can do or do I need to keep trying to solve this issue and get higher hashes?

What are your settings? I get 29 MH ETH and 280 MH DCR with PL 125 Core -50 and M 900 from Zotac 1070 mini.

For quality risers, splitters or 133 CFM fans, please visit my eBay listings,
http://www.ebay.com/sch/hawkfish007/m.html?_ipg=50&_sop=12&_rdc=1
CyberGI
Newbie
*
Offline Offline

Activity: 14
Merit: 0


View Profile
November 28, 2017, 03:07:49 AM
Last edit: November 28, 2017, 04:06:35 AM by CyberGI
 #5819

I cannot get my hashes for a gtx 1070 to get past 26ish mining eth no matter what I do.

Running nvOC 19 1.4

Is 26 just what linux can do or do I need to keep trying to solve this issue and get higher hashes?

What are your settings? I get 29 MH ETH and 280 MH DCR with PL 125 Core -50 and M 900 from Zotac 1070 mini.

I'm getting 28.8 on the minis. I was getting 26 with the stock settings. I posted earlier today looking for some better settings. I just started messing with it and ended with PL=125, Core=150, and MEM at 600 on the Mini. I'm running three of them.

Anyone else have recommended settings for a 1070, preferably a mini? And thanks for posting y'all's.

Edit: I am now using your 125/-50/900 values and now each of my three cards are cranking out 30Mh/s of ETH. Thanks for the numbers.
papampi
Full Member
***
Offline Offline

Activity: 686
Merit: 140


Linux FOREVER! Resistance is futile!!!


View Profile WWW
November 28, 2017, 06:53:31 AM
 #5820

I cannot get my hashes for a gtx 1070 to get past 26ish mining eth no matter what I do.

Running nvOC 19 1.4

Is 26 just what linux can do or do I need to keep trying to solve this issue and get higher hashes?

What are your settings? I get 29 MH ETH and 280 MH DCR with PL 125 Core -50 and M 900 from Zotac 1070 mini.

I'm getting 28.8 on the minis. I was getting 26 with the stock settings. I posted earlier today looking for some better settings. I just started messing with it and ended with PL=125, Core=150, and MEM at 600 on the Mini. I'm running three of them.

Anyone else have recommended settings for a 1070, preferably a mini? And thanks for posting y'all's.

Edit: I am now using your 125/-50/900 values and now each of my three cards are cranking out 30Mh/s of ETH. Thanks for the numbers.

With gigabyte 1070 I get
Ethahsh : 30-31 MH/s,  Power 130, CC -200, MC 800
Equihash: 470 Sol/s, Power 125, CC 150, MC 600

Have a look here too : https://bitcointalk.org/index.php?topic=2176936

Pages: « 1 ... 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 [291] 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 ... 417 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!