Bitcoin Forum
November 19, 2024, 03:38:43 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 [28] 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 »
  Print  
Author Topic: Smartcoin Linux mining administration. [MULTI-MACHINE SUPPORT NOW IN!]  (Read 105020 times)
plantucha
Newbie
*
Offline Offline

Activity: 56
Merit: 0


View Profile WWW
July 17, 2011, 10:56:35 PM
 #541

if I don't use AMDOverdriveCtrl - I get the following message after passing `aticonfig --od-setclocks=985,285 --adapter=all`:

Code:
ERROR - Set clocks failed for Adapter 1 - ATI Radeon HD 5800 Series  
        Please check that input values were valid

So AMDOverdriveCtrl must be doing something...And it's not `aticonfig --od-enable --adapter=all` ... that didn't change anything.

It would be good to know how to script Screen anyway - for general Linux education.

yes, ubuntu an other distros are using catalyst 11.5 -
yes you need AMDOverdriveCtrl if is your driver not up to date.
yes I did same , I was assuming I have 11.6.

just update catalyst to 11.6 it is easy copy/paste job, 10 min max:

http://wiki.cchtml.com/index.php/Main_Page

for ubuntu:
http://wiki.cchtml.com/index.php/Ubuntu_Natty_Installation_Guide

jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 17, 2011, 11:19:48 PM
 #542

Update isn't working for me -- it says the current stable version is 425 (I'm now on r449).  Do I need to completely uninstall and reinstall?  And is there a way to reinstall without wiping out my database?

How did you get to r449 if you're running stable?

You can go into Edit Settings, then change the development branch from 'stable' to 'experimental', after which time experimental updates will be available to you

RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
elrock
Newbie
*
Offline Offline

Activity: 41
Merit: 0


View Profile
July 17, 2011, 11:25:53 PM
 #543

Quote
How did you get to r449 if you're running stable?

You can go into Edit Settings, then change the development branch from 'stable' to 'experimental', after which time experimental updates will be available to you

I don't know, but that's the stable version number that was available for update.  I just updated to the experimental version to get rid of the error messages. Is it possible to revert to the last stable version, or am I stuck with experimental now?
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 17, 2011, 11:32:43 PM
 #544

plantucha -

I'm just testing the cgminer and poclbm support.
So far for me, cgminer has been the slowest of the bunch for me (about 1040 MHash/sec vs about 1150 MHash/sec in phoenix). Here is the launch string that I have been testing with:
Code:
./cgminer -a 4way -t 2 -g 2 -o http://pit.x8s.de:8337 -u <#user#> -p <#pass#> -I 14


Also, I don't get much more than 0.7MHash out of my GPU no matter what I do (though, it is an older P4)

Let me know if you have any suggestions!  I'll update this post once I can confirm that the cgminer support in smartcoin is complete


Maybe because cgminer has implemented
- Dynamic intensity that keeps desktop interactive under load and maximises throughput when deskop idle
so if you fiddling with PC intensity going down

or it can be just Intel/AMD sensitive

but anyway, I'm using CGminer for CPU only, Phoenix for GPUs

 
Code:
smartcoin Management System r442(experimental) Sun Jul 17 16:19:33 EDT 2011
--------------------------------------------------------------------------------
Host: localhost
GPU[0]: Temp: 69.00 load: 99%
GPU[1]: Temp: 70.00 load: 99%
GPU[2]: Temp: 71.00 load: 99%
GPU[3]: Temp: 72.00 load: 99%
CPU Load : 0.88%

Profile: Failover
--------BTCGuild--------
GPU[0]: [207.52 Mhash/sec] [132 Accepted] [1 Rejected] [RPC (+LP)]
GPU[1]: [207.48 Mhash/sec] [113 Accepted] [0 Rejected] [RPC (+LP)]
GPU[2]: [207.58 Mhash/sec] [132 Accepted] [1 Rejected] [RPC (+LP)]
GPU[3]: [207.51 Mhash/sec] [132 Accepted] [1 Rejected] [RPC (+LP)]
CPU:    [20.3 MHash/sec] [8 Accepted] [0 Rejected]
Total : [850.39 MHash/sec] [517 Accepted] [3 Rejected] [.580% Rejected]

Grand Total: [850.39 MHash/sec] [517 Accepted] [3 Rejected] [.580% Rejected]



Please note: for some reason after CPU mining is each GPU 2Mhash/s faster than without CPU mining.


now just tiny things like CPU load is 0.88% for GPU mining.
most likely 100% for CPU mining.

I'm getting CPU temperature from "lm-sensors", some people don't have lm-sensors installed.



also I personally do not understand, why people pushing sqlite.
This will be big project and i can see people using it in mining farms with 50+ computers.
Mysql is perfect solution and for mailing postfix.
Think BIG.
People don't cry for 50MB on your 1TB HDD.
Safety first and what is better an more often maintained than Mysql or Postfix?
Some shady project without security upgrades?


jondecker76 Thank you for your super fast development speed.

_______________________________________________________________________________ __________

P.S.:
I'm thinking about writing some documentation, soon it will be too complex for people who starting from this point.
But maybe after some front page simplification.
For example:
Pool configuration on front page
 - it is already pre-configured, but users will try to do something just because they see it...like jen
I think first page have to be just this:

1) Reboot Computer
2) Kill smartcoin (exit)
3) Disconnect from smartcoin (leave running)
4) Select Profile
5) Configure   
6) Update Smartcoin


and most important info:
Ctrl + a      =  switch to rigs live streaming
Ctrl + a +d =  disconnect from smartcoin (leave running)


Submenu 5.

Mandatory basic Configure:
1) Configure Workers

Advacend configure:
2) Edit Settings
3) Remote computers: (yes/no) , address conf. etc

Profile configure:
3) Configure Profiles
4) Set Profile Failover Order

Pre-configured for basic use (tune for your own hardware):
5) Configure Miners
6) Configure Devices
7) Configure Pools

Now, I can see I have to more think about it. Because I'm not sure if is it much easier or more clear than before.
hmmm
....But at least now you do have idea what I mean. Smiley




Thanks for all the feedback!  I do think that in the future, a new menu layout will need done. This project has gotten quite big and complex. I'll definitely take your suggestions into consideration!

Also, it would be great if you wanted to lead the charge for better documentation (sadly, I hardly have time to develop like I would like to, so updates in documentation have fallen behind). Perhaps starting a wiki entry would be a good starting point?


RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
Rob P.
Member
**
Offline Offline

Activity: 84
Merit: 10


View Profile WWW
July 18, 2011, 12:36:04 AM
 #545

Update isn't working for me -- it says the current stable version is 425 (I'm now on r449).  Do I need to completely uninstall and reinstall?  And is there a way to reinstall without wiping out my database?

How did you get to r449 if you're running stable?

You can go into Edit Settings, then change the development branch from 'stable' to 'experimental', after which time experimental updates will be available to you

Jon, something is up with the update system, see this:

Quote
smartcoin Management System r442(stable) Sun Jul 17 20:34:40 EDT 2011
--------------------------------------------------------------------------------
Getting svn information. Please be patient...
Your current version is r442(stable).
The current experimental version is r451
The current stable version is r425
Are you sure that you wish to perform an update?

You can see I'm running r442(stable), yet it's saying the current stable version is r425.  Said the same thing when I actually updated, but I ended up with r442(stable).

--

If you like what I've written here, consider tipping the messenger:
1GZu4CtHa6ai8iWoWiVFxV5VVoNte4SkoG

If you don't like what I've written, send me a Tip and I'll stop talking.
jl
Newbie
*
Offline Offline

Activity: 10
Merit: 0


View Profile
July 18, 2011, 02:32:53 AM
 #546

plantucha-
Quote
just update catalyst to 11.6 it is easy copy/paste job, 10 min max:
that did the trick - now I don't need to rely on AMDOverdriveCtrl anymore, thanks! Have applied `--od-commitclocks` so my setclocks persists. ty
plantucha
Newbie
*
Offline Offline

Activity: 56
Merit: 0


View Profile WWW
July 18, 2011, 03:51:58 AM
Last edit: July 18, 2011, 04:53:34 AM by plantucha
 #547

Update r450 experimental available
- Failovers can  happen now in the case where you get unusually high rejected shares (over 10%)
- Dead GPU detection - detects locked up GPU (most likely cause is too aggressive of an overclock).
- AutoDonate bug fixed.  I was finally able to reproduce it locally after a user PM'd me some good info.  It only effected a few, and it took a specific combination of start time and donation minutes.  
- The autodonate countdown timer is now displayed while donating again, and its accurate.  I ran it through a battery of tests manually to ensure accuracy (tests included start/stop times within the same hour, start/stop times that crossed over an hour, and start/stop times that crossed over midnight)


elrock - do another update,  you must have ran the  update right inbetween my commits!

Update r450
- Dead GPU detection
I got false detection. 3+ hours stable running and smartcoin shutdown.
10min after I find out about it and just start smartcoin again.
Previous.log says shutdown because too much overclocking and GPU locked.
But GPUs are not locked all GPUs happily mining again without PC restart.

I'm overclocking GPU just about 6% over, my mem  is factory set .
Temperatures 70°C, 70°C, 71°C, 72°C
All fans 100%
And it is tested stable for many days.

Now I'm on r451.

edit: when I wanna set back to factory speed
       GPU 0  was showing error.
       So it was locked.
       Stats showing GPU in work confused me.  Huh Usually after smartcoin restart it shows DOWN.
       MY BAD. Sorry for useless report.  Cry




dishwara
Legendary
*
Offline Offline

Activity: 1855
Merit: 1016



View Profile
July 18, 2011, 10:45:15 AM
 #548

Which is REALLY stable version?


xen82
Newbie
*
Offline Offline

Activity: 22
Merit: 0


View Profile
July 18, 2011, 11:44:16 AM
 #549

I started using smartcoin (r451 ex) recently and ran into an interesting bug: instead of failing over to a backup profile, smartcoin terminates with the following message:

07/18/11 14:05:06       ERROR: It appears that one or more of your devices have locked up.  This is most likely the result of extreme overclocking!
07/18/11 14:05:06              It is recommended that you reduce your overclocking until you regain stability of the system
07/18/11 14:05:06       Killing Miners....

The thing is that it app fails at the very beginning when trying to connect to BTCguild using phoenix r111 while BTCguild is unaccessible. Perhaps this is because the meter doesn't show up until a connection is established? I'm running r451 experimental. Overclocking is not an issue here as everything is pretty much stock, with just my fans cranked up to keep everything as cool as possible.

Love your the work! Smiley
plantucha
Newbie
*
Offline Offline

Activity: 56
Merit: 0


View Profile WWW
July 18, 2011, 12:26:08 PM
 #550

I started using smartcoin (r451 ex) recently and ran into an interesting bug: instead of failing over to a backup profile, smartcoin terminates with the following message:

07/18/11 14:05:06       ERROR: It appears that one or more of your devices have locked up.  This is most likely the result of extreme overclocking!
07/18/11 14:05:06              It is recommended that you reduce your overclocking until you regain stability of the system
07/18/11 14:05:06       Killing Miners....

The thing is that it app fails at the very beginning when trying to connect to BTCguild using phoenix r111 while BTCguild is unaccessible. Perhaps this is because the meter doesn't show up until a connection is established? I'm running r451 experimental. Overclocking is not an issue here as everything is pretty much stock, with just my fans cranked up to keep everything as cool as possible.

Love your the work! Smiley

Yes me too.
around 5am EST   BTCguild has problem with connectivity
=> failover switched to profile 1 and 2

all GPUs was already working on profile2 ( I have seen accepted work, so No GPU locked)
while profile 1 (BTCguild) was trying to connect and some GPUs had work, some GPUs was down.

then Anti-locking kicked in and killed all miners
before total switch to profile 2
and again 5 times after i start manually smartcoin

I have to set profile 2 without Failover to keep mining.

Overclocking is not a problem because my issues before I'm already on stock clock. using phoenix r108 + smartcoin r451ex

We need lower sensitivity overclock anti-lock.
or even better make anti-lock optional.


Fletch
Full Member
***
Offline Offline

Activity: 168
Merit: 100


I'll have a steak sandwich and a... steak sandwich


View Profile
July 18, 2011, 12:52:13 PM
 #551

also I personally do not understand, why people pushing sqlite.
This will be big project and i can see people using it in mining farms with 50+ computers.
Mysql is perfect solution and for mailing postfix.
Think BIG.
Because SQLite is the perfect solution for something of this size (even when using 50+ computers). I've run a discussion forum with hundreds of concurrent users and several hundred thousand posts off SQLite. The issue with SQLite and Smartcoin is solved. No need to drag a bloated project like MySQL into this.

Safety first and what is better an more often maintained than Mysql or Postfix?
Some shady project without security upgrades?
LOL

HashPeak - GPU mining hashrate peak detector
BTC: 1FLETCHvcUKosefrcZCLUQTtvx4WvgnYMC
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 18, 2011, 02:47:15 PM
 #552

Thanks for the reports guys!  I believe that I have fixed the stable update bug (yes, the stable version was actually 425- sorry!)


Update r452(stable) and r453(experimental) available!

r452(stable/experimental):
- Lockup detection is now a lot less sensitive. Lockup detection works by comparing the output (the text on the entire screen) of the actual miner between each iteration.  If its exactly the same, then a counter is incremented. The counter is reset each time the output looks different between iterations.  It used to have to see 10 consecutive iterations to figure that the card associated with that instance was locked up.  This turned out to be a bit too sensitive, as some cards run so stable that even the MHash reading doesn't change often.  Also, while changing profiles (automatically or manually), there is a small amount of time where the miner screens don't change at all while it starts up.  This value is now set to 50 iterations, which is probably a bit of an overshot (over 5 consecutive minutes where the miner screen doesn't change one bit), but at least it won't take people's miners offline on a false lockup.

- The stable update bug is fixed. It appears that the bug was introduced when the breakpoints were added.  We will see with newer updates if the stable and experimental versions split again like they are supposed to over the next few updates.


r453(experimental):
- Failover detection made a little less sensitive.  10 iterations of no hashing is now needed to flag a profile as down (it was previously 8 iterations).
- This update was also made so that we can test that its only available to the experimental branch, and the stable branch should stay at r452





Future updates will have custom settings for the failover detection and deadlock detection.

RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
tigereye
Member
**
Offline Offline

Activity: 79
Merit: 10



View Profile
July 18, 2011, 04:56:04 PM
 #553

tigereye

I just committed a change regarding the /tmp directory, perhaps it will help.  smartcoin is meant to be installed and run as user. Here is what I would do:

1) Under user, make sure you clear the data directory:
Code:
rm -r ~/.smartcoin

2) Clear out any temporary data:
Code:
sudo rm /tmp/smartcoin*

3) Follow the install stuff again.  Remove the old smartcoin directory and start again.

4) During the detection of the AMD/ATI SDK detection, there is a bug on some systems which shows two paths on one line.  Since Linuxcoin is 64 bit, make sure to leave only the 64 bit relevant path!

5) If after starting things appear not to run, detach from smartcoin (option 3 on the control screen), then run
Code:
screen -r miner

This will bring you to the individual  miner instances where you can see directly the error output of phoenix
Jon,

I've reinstalled smartcoin from scratch and re-setup all of my workers.
I was seeing the same symptoms as before (a bunch of "grep" error messages flying across the screen every few seconds, and no miners would stay up)

Then I created a manual profile and selected it, and it works!
I went back to the automatic profile, and it showed the same symptoms.

It seems something's funky with the 'automatic' profile.

Hope this helps you zero in on things. If you need any debug info to help narrow it down, let me know.

If my posts have helped, consider leaving a tip! 1AE5e56ivvaGMJJmLrZoLgiZXPx93CddyA
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 18, 2011, 05:39:03 PM
 #554

tigereye

I just committed a change regarding the /tmp directory, perhaps it will help.  smartcoin is meant to be installed and run as user. Here is what I would do:

1) Under user, make sure you clear the data directory:
Code:
rm -r ~/.smartcoin

2) Clear out any temporary data:
Code:
sudo rm /tmp/smartcoin*

3) Follow the install stuff again.  Remove the old smartcoin directory and start again.

4) During the detection of the AMD/ATI SDK detection, there is a bug on some systems which shows two paths on one line.  Since Linuxcoin is 64 bit, make sure to leave only the 64 bit relevant path!

5) If after starting things appear not to run, detach from smartcoin (option 3 on the control screen), then run
Code:
screen -r miner

This will bring you to the individual  miner instances where you can see directly the error output of phoenix
Jon,

I've reinstalled smartcoin from scratch and re-setup all of my workers.
I was seeing the same symptoms as before (a bunch of "grep" error messages flying across the screen every few seconds, and no miners would stay up)

Then I created a manual profile and selected it, and it works!
I went back to the automatic profile, and it showed the same symptoms.

It seems something's funky with the 'automatic' profile.

Hope this helps you zero in on things. If you need any debug info to help narrow it down, let me know.

Can you double check one thing for me?  Go to Configure Miners->Edit and edit the miner you are using. Make sure to answer "y" (yes) when it askes if it is the default miner. (The automatic profile uses the miner flagged as default - it is possible that one isn't flagged as default, or was never flagged initially)

Also, I'm committing a couple of changes that silence grep errors, as there are times during a profile change where grep is looking for files created by the previous profile (they catch up after an interation though).  Between the two, I think your auto profile should work normally after this.

Also, if after doing the steps listed above, your miners appear not to be working in automatic profile, here is some troubleshooting help:
1) Detach from smartcoin, then run:
Code:
screen -r miner
You should connect to a screen session that has a tab for each instance which you can look at to see if the miners are actually running

2) Look in /tmp/ for smartcoin* files. These are the files created from the individual miner instances. You should make sure that they exist, and you can even look at the files themselves to make sure they contain the miner output.

Let me know what you find!

RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 18, 2011, 05:40:06 PM
 #555

Update r455(experimental) available:
- grep errors are suppressed
- Fixed a small bug that resets the failover counters on smartcoin restart


RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
tigereye
Member
**
Offline Offline

Activity: 79
Merit: 10



View Profile
July 18, 2011, 06:08:37 PM
 #556

Jon, thanks for the quick reply!
Here are the answers to your questions:
  • I only have one miner configured: phoenix. It is configured as the default miner, and the path and command seem fine.
  • There are no files in /tmp/ named Smartcoin* when the problem occurs. Nothing gets created there when "automatic" profile is selected, however files get created there when I use any of my custom profiles
  • When I do a "screen -ls" I do not see any screens for miners when the automatic profile is selected. However, when I choose one of my custom profiles, I do see the 'miner' screen listed
I'm not sure what's up, but the automatic profile seems broken.
I've confirmed these results on two separate linuxcoin 0.2b1 installs.

At first I thought smartcoin was broken entirely, but when I created my own profiles it worked like a charm Smiley

Keep up the great work,

If my posts have helped, consider leaving a tip! 1AE5e56ivvaGMJJmLrZoLgiZXPx93CddyA
krzynek1
Newbie
*
Offline Offline

Activity: 41
Merit: 0


View Profile
July 18, 2011, 06:25:05 PM
 #557

at last not a newbie Wink

JD, i must confirm, that autodonation is functioning like it should

gj = good job
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 18, 2011, 06:31:19 PM
 #558

I can't recreate the issue here locally - I'm a bit stumped..  Is there anything in the ~/.smartcoin/smartcoin.log files that may give a hint of the problem?
I do have one other puzzling report (in my PM) where the permissions of /var/run/screen get messed up and they have to chmod 777 /var/run/screen to work properly (though this makes entirely no sense at all to me) - perhaps this is the same issue?  (though I'm not certain it is, as your problem is only automatic profile related)


Would you be able to allow me temporary ssh access to see if I can figure it out? (of course, only if you machine is secure and there is no wallet.dat etc. laying around). If so, shoot me a PM and I'll take a peek



EDIT:
https://bugs.launchpad.net/ubuntu/+source/screen/+bug/574773
perhaps running this will clear things up:
Code:
sudo /etc/init.d/screen-cleanup start



RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
jondecker76 (OP)
Full Member
***
Offline Offline

Activity: 238
Merit: 100


View Profile
July 18, 2011, 06:35:31 PM
 #559

at last not a newbie Wink

JD, i must confirm, that autodonation is functioning like it should

gj = good job

Thanks for the confirmation!

RollerBot Advanced Trading Platform
https://bitcointalk.org/index.php?topic=447727.0
BTC Donations for development: 1H36oTJsi3adFh68wwzz95tPP2xoAoTmhC
Fletch
Full Member
***
Offline Offline

Activity: 168
Merit: 100


I'll have a steak sandwich and a... steak sandwich


View Profile
July 18, 2011, 08:03:52 PM
 #560

I checked the queries being executed by RunSQL and in my case it's a total of 35 queries per refresh (failover profile with 3 workers), 14 of which are queries fetching donation_start and donation_time from the settings table. Like I mentioned earlier, I have very little experience with shell scripting, but this sounds like it should be easy to optimize. It appears DonationActive is called multiple times per refresh so couldn't you just call it once at the start of smartcoin_ops.sh and store the return value in a global variable?

You could also JOIN the device table into the query at the end of GenCurrentProfile and include device.name in the result. That would mean you could get rid of the "SELECT name FROM device WHERE pk_device=$device;" query in ShowStatus. That's one less query per worker per refresh.

I'm sure there are other optimizations that could be done. This is just based on a 20 minute browse of the source code.

Regardless, the error checking you added to RunSQL seems sane. I would probably add a short delay (0.1 seconds perhaps) in the loop so that if the database is locked, it has time to be released before the next attempt is made.

Just my 2 cents.

PS. I hope you had a great time at Cedar Point. I wish there was something of that scale here in Sweden Smiley
Did you get a change to look at this? Doing just the two optimizations I mentioned above would in my case almost halve the number of queries being executed every refresh.

HashPeak - GPU mining hashrate peak detector
BTC: 1FLETCHvcUKosefrcZCLUQTtvx4WvgnYMC
Pages: « 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 [28] 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 »
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!