View unanswered posts    View active topics

All times are UTC - 6 hours





Post new topic Reply to topic  [ 32 posts ] 
Go to page Previous  1, 2, 3  Next

Print view Previous topic   Next topic  
Author Message
Search for:
 Post subject: Re: bizarre symptom
PostPosted: Fri Nov 18, 2011 9:34 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
knappster wrote:
That is definitely different than my lockups. I have hard lockups where the HDD activity completely stops, RRD stops logging, I cannot get to the login screen in ssh or mythweb. I would think you would get some sort of log on yours where something has gone awry. Maybe your /var/log/Xorg.0.log files will provide some information since your display gets goofed up when the CPU shoots up. Good luck!

I should point out that this is the first time a lockup hasn't been total (like yours), where all RRD logging stops. A total halt to everything has been the way my other lockups have gone. As far as I know this is the first half-assed lock up. ;)

How did you get your RRD graph to talk to the motherboard and CPU sensors?
tjc helped me get the RRD hard drive temperature working, but I don't have CPU or motherboard data flowing.
http://knoppmyth.net/phpBB2/viewtopic.php?t=20014&highlight=rrd

Image
Image

Good thought on the x.org log. I'll have a look right now....

I don't see any error messages. This log appears to get written during every bootup and not during the course of normal operation. Thus, it doesn't log events and give clues to what happened at a particular time.

see: http://pastebin.com/guaRMKC5

Eric

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Last edited by neutron68 on Sat Nov 19, 2011 11:16 am, edited 4 times in total.


Top
 Profile  
 
 Post subject:
PostPosted: Fri Nov 18, 2011 10:32 pm 
Offline
Joined: Thu Sep 27, 2007 5:44 pm
Posts: 336
I think my RRD picked one them up automatically... however, I'm not sure if it labels it accurately. I do get hard drive temperature as well. But the hard drive temp ramps up pretty quickly and my cpu temp varies a bit more depending on load. I am in the middle of cloning the hard drive so that I can try a different motherboard tomorrow without changing anything on the current hard drive. Once things are stable again I will let you know what my settings are. Alternatively, if you just want to read the current temps from an ssh window, I normally use mbmon I think...


Top
 Profile  
 
 Post subject:
PostPosted: Sun Nov 20, 2011 12:32 pm 
Offline
Joined: Thu Sep 27, 2007 5:44 pm
Posts: 336
This was the contents of the /usr/bin/rrd_mbtemp.pl file

Code:
#!/usr/bin/perl
#
# Modified from Martin Pot's 2003 script at
# http://martybugs.net/linux/hddtemp.cgi
#
# Graphs motherboard temperatures using mbmon
# rrd_mbtemp.pl

use RRDs;

# define location of rrdtool databases
my $rrd = '/var/lib/rrd';
# define location of images
my $img = '/data/srv/httpd/htdocs/rrd';

# process data for each temperature sensor (add/delete as required)
# run mbmon -r -c 1 to see what temps your mobo supports, then make
# an educated guess about which component corresponds to which temp
# Note TEMP0 = T 1, TEMP1 = T 2, TEMP2 = T 3
&ProcessHDD("mbtemp", "T 1", "Motherboard");
&ProcessHDD("cputemp", "T 2", "CPU");
&ProcessHDD("ambtemp", "T 3", "Case ambient");

sub ProcessHDD
{
# process HDD
# inputs: $_[0]: component being measured (used for graph filenames)
#          $_[1]: code (mbmon code)
#         $_[2]: description

        # get temp value from mbmon
        my $temp=`/usr/bin/mbmon -c 1 -$_[1]`;
        # remove eol chars and white space
        $temp =~ s/[\n ]//g;

        print "$_[2] temp: $temp degrees C\n";

        # if rrdtool database doesn't exist, create it
        if (! -e "$rrd/$_[0].rrd")
        {
                print "creating rrd database for $_[0]...\n";
                RRDs::create "$rrd/$_[0].rrd",
                        "-s 300",
                        "DS:temp:GAUGE:600:0:100",
                        "RRA:AVERAGE:0.5:1:576",
                        "RRA:AVERAGE:0.5:6:672",
                        "RRA:AVERAGE:0.5:24:732",
                        "RRA:AVERAGE:0.5:144:1460";
        }

        # insert value into rrd
        RRDs::update "$rrd/$_[0].rrd",
                "-t", "temp",
                "N:$temp";

        # create graphs
        &CreateGraph($_[0], "day", $_[2]);
        &CreateGraph($_[0], "week", $_[2]);
        &CreateGraph($_[0], "month", $_[2]);
        &CreateGraph($_[0], "year", $_[2]);
}

sub CreateGraph
{
# creates graph
# inputs: $_[0]: component being measured (used for graph filenames)
#         $_[1]: interval (ie, day, week, month, year)
#         $_[2]: description

        RRDs::graph "$img/$_[0]-$_[1].png",
                "--lazy",
                "-s -1$_[1]",
                "--color", "SHADEA#EAE9EE",
                "--color", "SHADEB#EAE9EE",
                "--color", "BACK#EAE9EE",
                "-t $_[2] Temperature - $_[1]",
                "-h", "100", "-w", "600",
                "-a", "PNG",
                "-v degrees C",
                "DEF:temp=$rrd/$_[0].rrd:temp:AVERAGE",

                # This CDEF section defines a variable for every 5 degree increment of temperature
                # between 5 and 130 degrees. Undefined/unknown variables are set to zero.
                "CDEF:tempz=temp,130,LT,temp,130,IF",
                "CDEF:tempzNoUnk=temp,UN,0,tempz,IF",
                "CDEF:tempy=temp,125,LT,temp,125,IF",
                "CDEF:tempyNoUnk=temp,UN,0,tempy,IF",
                "CDEF:tempx=temp,120,LT,temp,120,IF",
                "CDEF:tempxNoUnk=temp,UN,0,tempx,IF",
                "CDEF:tempw=temp,115,LT,temp,115,IF",
                "CDEF:tempwNoUnk=temp,UN,0,tempw,IF",
                "CDEF:tempv=temp,110,LT,temp,110,IF",
                "CDEF:tempvNoUnk=temp,UN,0,tempv,IF",
                "CDEF:tempu=temp,105,LT,temp,105,IF",
                "CDEF:tempuNoUnk=temp,UN,0,tempu,IF",
                "CDEF:tempt=temp,100,LT,temp,100,IF",
                "CDEF:temptNoUnk=temp,UN,0,tempt,IF",
                "CDEF:temps=temp,95,LT,temp,95,IF",
                "CDEF:tempsNoUnk=temp,UN,0,temps,IF",
                "CDEF:tempr=temp,90,LT,temp,90,IF",
                "CDEF:temprNoUnk=temp,UN,0,tempr,IF",
                "CDEF:tempq=temp,85,LT,temp,85,IF",
                "CDEF:tempqNoUnk=temp,UN,0,tempq,IF",
                "CDEF:tempp=temp,80,LT,temp,80,IF",
                "CDEF:temppNoUnk=temp,UN,0,tempp,IF",
                "CDEF:tempo=temp,75,LT,temp,75,IF",
                "CDEF:tempoNoUnk=temp,UN,0,tempo,IF",
                "CDEF:tempn=temp,70,LT,temp,70,IF",
                "CDEF:tempnNoUnk=temp,UN,0,tempn,IF",
                "CDEF:tempm=temp,65,LT,temp,65,IF",
                "CDEF:tempmNoUnk=temp,UN,0,tempm,IF",
                "CDEF:templ=temp,60,LT,temp,60,IF",
                "CDEF:templNoUnk=temp,UN,0,templ,IF",
                "CDEF:tempk=temp,55,LT,temp,55,IF",
                "CDEF:tempkNoUnk=temp,UN,0,tempk,IF",
                "CDEF:tempj=temp,50,LT,temp,50,IF",
                "CDEF:tempjNoUnk=temp,UN,0,tempj,IF",
                "CDEF:tempi=temp,45,LT,temp,45,IF",
                "CDEF:tempiNoUnk=temp,UN,0,tempi,IF",
                "CDEF:temph=temp,40,LT,temp,40,IF",
                "CDEF:temphNoUnk=temp,UN,0,temph,IF",
                "CDEF:tempg=temp,35,LT,temp,35,IF",
                "CDEF:tempgNoUnk=temp,UN,0,tempg,IF",
                "CDEF:tempf=temp,30,LT,temp,30,IF",
                "CDEF:tempfNoUnk=temp,UN,0,tempf,IF",
                "CDEF:tempe=temp,25,LT,temp,25,IF",
                "CDEF:tempeNoUnk=temp,UN,0,tempe,IF",
                "CDEF:tempd=temp,20,LT,temp,20,IF",
                "CDEF:tempdNoUnk=temp,UN,0,tempd,IF",
                "CDEF:tempc=temp,15,LT,temp,15,IF",
                "CDEF:tempcNoUnk=temp,UN,0,tempc,IF",
                "CDEF:tempb=temp,10,LT,temp,10,IF",
                "CDEF:tempbNoUnk=temp,UN,0,tempb,IF",
                "CDEF:tempa=temp,5,LT,temp,5,IF",
                "CDEF:tempaNoUnk=temp,UN,0,tempa,IF",

                # This section defines a color gradient for displaying the temperature data
                "AREA:tempzNoUnk#ff0000",
                "AREA:tempyNoUnk#ff0000",
                "AREA:tempxNoUnk#ff0000",
                "AREA:tempwNoUnk#ff0000",
                "AREA:tempvNoUnk#ff1b00",
                "AREA:tempuNoUnk#ff4100",
                "AREA:temptNoUnk#ff6600",
                "AREA:tempsNoUnk#ff8e00",
                "AREA:temprNoUnk#ffb500",
                "AREA:tempqNoUnk#ffdb00",
                "AREA:temppNoUnk#fdff00",
                "AREA:tempoNoUnk#d7ff00",
                "AREA:tempnNoUnk#b0ff00",
                "AREA:tempmNoUnk#8aff00",
                "AREA:templNoUnk#65ff00",
                "AREA:tempkNoUnk#3eff00",
                "AREA:tempjNoUnk#17ff00",
                "AREA:tempiNoUnk#00ff10",
                "AREA:temphNoUnk#00ff36",
                "AREA:tempgNoUnk#00ff5c",
                "AREA:tempfNoUnk#00ff83",
                "AREA:tempeNoUnk#00ffa8",
                "AREA:tempdNoUnk#00ffd0",
                "AREA:tempcNoUnk#00fff4",
                "AREA:tempbNoUnk#00e4ff",
                "AREA:tempaNoUnk#00beff",

                # Line along the top of the gradient graph
                "LINE2:temp#0000FF:$_[2] ($_[0])",

                "GPRINT:temp:MIN:  Min\\: %2.lf",
                "GPRINT:temp:MAX: Max\\: %2.lf",
                "GPRINT:temp:AVERAGE: Avg\\: %4.1lf",
                "GPRINT:temp:LAST: Current\\: %2.lf degrees C\\n";
        if ($ERROR = RRDs::error) { print "$0: unable to generate $_[0] graph: $ERROR\n"; }
}


Unfortunately, I have replaced the motherboard with the Abit AN-M2 now and mbmon does not work, so I have just modified the file to replace:

Code:
        my $temp=`/usr/bin/mbmon -c 1 -$_[1]`;


with

Code:
        my $temp=`sensors | grep -m 1 -i "Core0 Temp" | cut -c 15-18`;


I don't think my sensors output is accurate and it seems to bounce around a lot, but that will at least provide one of the temperatures so that I can monitor it and go from there.


Top
 Profile  
 
 Post subject: sensors program
PostPosted: Sun Nov 20, 2011 2:15 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
I did a little searching and it looks I also need to use the "sensors" program to get my data from the Asus motherboard I'm using.

Thus far, all I can tell you is that if I type "sensors" I get this:
Code:
root@mythtv:~# sensors
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp:  +22.0 C
Core1 Temp:  +33.0 C

So I only have 2 sensors available??

The /etc/sensors.conf file has these lines in it.
Code:
chip "k8temp-*"

   label temp1 "Core0 Temp"
   label temp2 "Core0 Temp"
   label temp3 "Core1 Temp"
   label temp4 "Core1 Temp"

What does your /etc/rrd.conf file look like?

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
 Post subject: Re: sensors program
PostPosted: Sun Nov 20, 2011 5:58 pm 
Offline
Joined: Thu Sep 27, 2007 5:44 pm
Posts: 336
neutron68 wrote:
I did a little searching and it looks I also need to use the "sensors" program to get my data from the Asus motherboard I'm using.

Thus far, all I can tell you is that if I type "sensors" I get this:
Code:
root@mythtv:~# sensors
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp:  +22.0 C
Core1 Temp:  +33.0 C

So I only have 2 sensors available??

The /etc/sensors.conf file has these lines in it.
Code:
chip "k8temp-*"

   label temp1 "Core0 Temp"
   label temp2 "Core0 Temp"
   label temp3 "Core1 Temp"
   label temp4 "Core1 Temp"

What does your /etc/rrd.conf file look like?


My /etc/sensors3.conf file has the same k8temp-* info.
With R7.1 I do not have a /etc/rrd.conf, instead I have /etc/rrdsettings.cfg which contains:

Code:
# use the following format
# DISKDEV=full path to device : label for graph (no spaces)

DISKDEV=/dev/sda1:system
DISKDEV=/dev/sda3:myth
#DISKDEV=//192.168.1.5/Volume_1/Video:nas


my sensors reports:
Code:
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp:  +24.0°C
Core0 Temp:  +22.0°C
Core1 Temp:  +23.0°C
Core1 Temp:  +18.0°C

acpitz-virtual-0
Adapter: Virtual device
temp1:       +32.0°C  (crit = +90.0°C)

The acpitz-virtual-0 appears to be my hdd temp. I'm not sure what to make of my other 4 temps, so I just decided to record the first one.


Top
 Profile  
 
 Post subject: trying to decipher
PostPosted: Sun Nov 20, 2011 6:42 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
I've been looking at this thread - almost exact same situation as mine.
http://knoppmyth.net/phpBB2/viewtopic.php?t=16546&postdays=0&postorder=asc&start=0
They also used the 'sensors' program.

There are 3 files that seem to need to be in sync and I can't tell how they all tie together.
It looks like one file refers to another and the variables in one need to be referenced in another.
1. /etc/sensors.conf
2. /etc/rrd.config
3. /usr/local/bin/rrd_MBtemp.pl

Given that the Core0 and Core1 temperatures are drastically different, I don't think they are the two cores of my dual core CPU:
Code:
root@mythtv:~# sensors
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp:  +23.0 C
Core1 Temp:  +35.0 C
I think that Core0 may be the motherboard temp sensor and Core1 is the CPU temperature sensor.

My /etc/rrd.config file looks like this. I edited the "Motherboard / CPU temperature" section to try and match what they were doing in the thread, but I don't think I have it right.
Code:
#!/usr/bin/perl
#
#  Configuration file for the KnoppMyth rrd data collection routines.
#
########################################################################
# Global Configuration - config used by all the rrd_XXX.pl scripts:
#=======================================================================
$log = '/myth/rrd/log';      # location of the database (log) files
$png = '/myth/rrd/png';      # location of graph (png) files
#-----------------------------------------------------------------------
$color = '#EAE9EE';      # background color used in the graphs
$Gwd = 600;         # graph width in pixels
$Ght = 100;         # graph height in pixels
########################################################################
# Individual Configuration for each collection type:
#=======================================================================
#  rrd_CPU.pl configuration:   # cpu utilization
#-----------------------------------------------------------------------
$color_cpuS = '#FF0000';   # color of 'System' area of the graph
$color_cpuU = '#0000FF';   # color of 'User' area of the graph
$color_cpuN = '#00FF00';   # color of 'Nice' area of the graph
$color_cpuI = '#EEEE00';   # color of 'Idle' area of the graph
#=======================================================================
#  rrd_Mem.pl configuration:   # memory utilization
#-----------------------------------------------------------------------
$color_memU = '#0000FF';   # color of 'Used' area of the graph
$color_memB = '#FF00FF';   # color of 'Buffered' area of the graph
$color_memC = '#FFFF00';   # color of 'Cached' area of the graph
$color_memF = '#00FF00';   # color of 'Free' area of the graph
#=======================================================================
#  rrd_Swap.pl configuration:   # swap utilization
#-----------------------------------------------------------------------
$color_swpU = '#0000FF';   # color of 'Used' area of the graph
$color_swpF = '#00FF00';   # color of 'Free' area of the graph
#=======================================================================
#  rrd_Net.pl configuration:   # network utilization
#-----------------------------------------------------------------------
$color_netO = '#FFFF00';   # color of 'Outgoing' area of the graph
$color_netT = '#888800';   # color of 'Outgoing' border
$color_netI = '#00FF00';   # color of 'Incoming' area of the graph
$color_netB = '#008800';   # color of 'Incoming' border
#=======================================================================
#  rrd_Disk.pl configuration:   # disk reads/writes
#-----------------------------------------------------------------------
$color_dskR = '#0000FF';   # color of 'Read' line of the graph
$color_dskW = '#FF0000';   # color of 'Write' line of the graph
#=======================================================================
#  rrd_MythFS.pl configuration:   # mount point space/inodes utilization
#-----------------------------------------------------------------------
$mnt_dir = '/myth';      # mount point to check
$color_mntS = '#0000FF';   # color of 'Space' line of the graph
$color_mntI = '#008800';   # color of 'Inodes' line of the graph
#=======================================================================
#  rrd_Count.pl configuration:   # count of files in a directory
#-----------------------------------------------------------------------
$CNT_dir = '/myth/tv';      # directory in which to count
$CNT_ex1 = '*.mpg';      # glob expression for first count
$CNT_ex2 = '*.nuv';      # glob expression for second count
$color_cnt1 = '#0000FF';   # color of 'First' count graph line
$color_cnt2 = '#008800';   # color of 'Second' count graph line
$CNT_prog = 'rrd_file_count';   # program to do the actual counting
#=======================================================================
#  rrd_Load.pl configuration:   # 1, 5, 15 minute load averages
#-----------------------------------------------------------------------
$color_avg1 = '#FF0000';   # color of '1' minute graph line
$color_avg2 = '#00FF00';   # color of '5' minute graph line
$color_avg3 = '#0000FF';   # color of '15' minute graph line
#=======================================================================
#  rrd_HDtemp.pl config:   # hard disk temperature
#-----------------------------------------------------------------------
$HDT_dev = 'sda';             # devices to measure
@HDT_colors = ('#0000FF', '#00FF00', '#FF0000', '#00FFFF', '#FF00FF', '#FFFF00');
#=======================================================================
#  rrd_MBtemp.pl configuration:   # Motherboard / CPU temperature
#-----------------------------------------------------------------------
$color_mbt1 = '#00FF00';   # color of 'motherboard' graph line
$color_mbt2 = '#FF0000';   # color of 'CPU1' graph line
$color_mbt3 = '#0000FF';   # color of 'CPU2' graph line
$MBT_mobo   = 'MB';             #
$MBT_core1   = 'CPU1';           #
$MBT_core2   = 'CPU2';           #
$MBT_prog = 'sensors';      # program & args to gather the data
#=======================================================================
#  rrd_MBfan.pl configuration:   # Motherboard / CPU fan sppeds
#-----------------------------------------------------------------------
$color_mbf1 = '#00FF00';   # color of 'motherboard' graph line
$color_mbf2 = '#FF0000';   # color of 'CPU' graph line
$color_mbf3 = '#0000FF';   # color of case 'Ambient' graph line
$MBF_mobo   = '3';              #
$MBF_cpu    = '2';              #
$MBF_case   = '1';              #
$MBF_prog = 'mbmon -c 1 -F';   # program & args to gather the data
########################################################################
# vim: sw=4 ts=8
# End


I have tried to alter the MBTemp scirpt to match what they were showing in the referenced thread, but it's not quite right either. I'm getting confused on the many variables they are using.
Code:
#!/usr/bin/perl
#
#  rrd_MBtemp.pl
#   Motherboard Temperature data collection routine for KnoppMyth
#
########################################################################
# Configuration:
my $dbf = 'MBtemp';
my $configfile = '/etc/rrd.config';
########################################################################
use RRDs;

if (! -d "/myth") { $configfile = "./D_rrd.config"; }   # DEBUG
do $configfile;

sub create {
    #   $_[0] = filename
    if (! -e "$log/$_[0].rrd") {
   print "Create db for $_[0] => $log/$_[0].rrd\n";
   RRDs::create( "$log/$_[0].rrd", "-s 300",
       "DS:mbt1:GAUGE:600:0:U",
       "DS:mbt2:GAUGE:600:0:U",
       "DS:mbt3:GAUGE:600:0:U",
       "RRA:AVERAGE:0.5:1:576",
       "RRA:AVERAGE:0.5:6:672",
       "RRA:AVERAGE:0.5:24:732",
       "RRA:AVERAGE:0.5:144:1460");
   $ERROR = RRDs::error;
   print "Error: RRDs::create failed for '$_[0]' : $ERROR\n" if $ERROR;
    }
}

my ($mbt1, $mbt2, $mbt3);

sub gather {
    $mbt1 = `$MBT_prog | grep "M/B Temp" | cut -c 15-17`;
    $mbt2 = `$MBT_prog | grep "Core0 Temp" | cut -c 15-18`;
    $mbt3 = `$MBT_prog | grep "Core1 Temp" | cut -c 15-18`;
    if ($MBT_mobo ne "") {
        $mbt1 = `$MBT_prog $MBT_mobo`;
    }
    if ($MBT_core1 ne "") {
        $mbt2 = `$MBT_prog $MBT_core1`;
    }
    if ($MBT_core2 ne "") {
        $mbt3 = `$MBT_prog $MBT_core2`;
    }
    $mbt1 =~ s/[\n ]//g;
    $mbt2 =~ s/[\n ]//g;
    $mbt3 =~ s/[\n ]//g;
    print "$dbf: motherboard $mbt1, CPU1 $mbt2, CPU2 $mbt3, °C\n";

}

sub update {
    #   $_[0] = filename
    RRDs::update( "$log/$_[0].rrd", "-t",
   "mbt1:mbt2:mbt3",
   "N:$mbt1:$mbt2:$mbt3");
    $ERROR = RRDs::error;
    print "Error: RRDs::update for '$_[0]' : $ERROR\n" if $ERROR;
}


sub graph {
    #   $_[0] = time interval (ie: day...)
    #   $_[1] = filename suffix.
    undef @args;
    @args = ( "$png/$dbf-$_[1].png", "-s -1$_[0]", "-aPNG",
        "-w $Gwd", "-h $Ght", "-E", "-l 20", "-M",
        "--color", "SHADEA$color",
        "--color", "SHADEB$color",
        "--color", "BACK$color",
        "-t Motherboard & CPU temperature degrees C :: $_[0]");
    if ($MBT_mobo ne "") {
        push @args, "DEF:mbt1=$log/$dbf.rrd:mbt1:AVERAGE",
        "LINE1:mbt1$color_mbt1: MB temp\\:",
   "GPRINT:mbt1:MIN:Minimum\\: % 5.1lf",
   "GPRINT:mbt1:MAX:Maximum\\: % 5.1lf",
   "GPRINT:mbt1:AVERAGE:Average\\: % 5.1lf",
   "GPRINT:mbt1:LAST:Current\\: % 5.1lf °C\\j";
    }
    if ($MBT_core1 ne "") {
        push @args, "DEF:mbt2=$log/$dbf.rrd:mbt2:AVERAGE",
   "LINE1:mbt2$color_mbt2:CPU1 temp\\:",
   "GPRINT:mbt2:MIN:Minimum\\: % 5.1lf",
   "GPRINT:mbt2:MAX:Maximum\\: % 5.1lf",
   "GPRINT:mbt2:AVERAGE:Average\\: % 5.1lf",
   "GPRINT:mbt2:LAST:Current\\: % 5.1lf °C\\j";
    }
    if ($MBT_core2 ne "") {
        push @args, "DEF:mbt3=$log/$dbf.rrd:mbt3:AVERAGE",
   "LINE1:mbt3$color_mbt3:CPU2 temp\\:",
   "GPRINT:mbt3:MIN:Minimum\\: % 5.1lf",
   "GPRINT:mbt3:MAX:Maximum\\: % 5.1lf",
   "GPRINT:mbt3:AVERAGE:Average\\: % 5.1lf",
   "GPRINT:mbt3:LAST:Current\\: % 5.1lf °C\\j";
    }
    RRDs::graph(@args);
    $ERROR = RRDs::error;
    print "Error: RRDs::graph failed for '$_[0]' : $ERROR\n" if $ERROR;
}
########################################################################
create "$dbf";
gather;
update "$dbf";
graph 'day',   'Daily';
graph 'week',  'Weekly';
graph 'month', 'Monthly';
graph 'year',  'Yearly';
########################################################################
# vim: sw=4 ts=8
# End


Focus on the "sub gather { " section.
Can you tell where my variable mismatches are?

Eric

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
 Post subject:
PostPosted: Mon Nov 21, 2011 2:01 pm 
Offline
Joined: Thu Mar 25, 2004 11:00 am
Posts: 9551
Location: Arlington, MA
You may need to run the sensors configuration wizard sensors-detect as root. I was just skimming but didn't see it mentioned anywhere in the thread.

The motherboard in my current production system didn't work with mbmon so I had to install the sensors package and convert the rrd scripts to use that too.


Top
 Profile  
 
 Post subject:
PostPosted: Mon Nov 21, 2011 2:48 pm 
Offline
Joined: Thu Sep 27, 2007 5:44 pm
Posts: 336
Code:
    $mbt1 = `$MBT_prog | grep "M/B Temp" | cut -c 15-17`;
    $mbt2 = `$MBT_prog | grep "Core0 Temp" | cut -c 15-18`;
    $mbt3 = `$MBT_prog | grep "Core1 Temp" | cut -c 15-18`;

You do not have an "M/B Temp" so I don't know how the script will respond to this empty set for $mbt1.

What do you get if you type
Code:
sensors | grep "M/B Temp" | cut -c 15-18
in an xterm window?

Similarly for:
Code:
sensors | grep "Core0 Temp" | cut -c 15-17
and
Code:
sensors | grep "Core1 Temp" | cut -c 15-18


Maybe you can try commenting out the line for $mbt1 and replace it with
Code:
$mbt1 = 0;


I'm not sure about the rest because your rrd seems to be quite a bit older than mine.


Top
 Profile  
 
 Post subject: I'll start an RRD thread
PostPosted: Sun Nov 27, 2011 11:43 am 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
I guess this RRD stuff isn't directly related to the lockup diagnosis.

I'll start a new thread for it:

see:
http://knoppmyth.net/phpBB2/viewtopic.p ... 535#133535

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Last edited by neutron68 on Sun Nov 27, 2011 8:47 pm, edited 1 time in total.


Top
 Profile  
 
 Post subject: swapped motherboard
PostPosted: Sun Nov 27, 2011 8:36 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
I just swapped out the motherboard, along with CPU and 2GB of RAM, with a clone (exact same brand and model).

We'll see if the lockups stop or not.
The only original things left in that box are the 2 tuner cards and the Knoppmyth R5.5 OS on the harddrive.

Eri

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
PostPosted: Thu Dec 01, 2011 8:29 am 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
It's now 3.5 days since I swapped the motherboard with my spare...no lockups yet! :D

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
 Post subject: 6 days and no lockups
PostPosted: Sat Dec 03, 2011 3:30 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
It's now been 6 days since the MB swap...no lockups.

Eric

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
 Post subject: 2 weeks and no lockups
PostPosted: Sun Dec 11, 2011 12:10 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
It's now been 2 weeks since I swapped the MB, CPU and RAM...still no lockups.

I'm going to call this one solved. 8)

Eric

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 
 Post subject:
PostPosted: Wed Dec 28, 2011 5:51 pm 
Offline
Joined: Thu Sep 27, 2007 5:44 pm
Posts: 336
knappster wrote:
slowtolearn wrote:
My first thought when encountering mysterious lockups is heat. I have made it a habit to blow out each of my systems with a compressor (careful, make sure you have a moisture filter on it, or use those cans of compressed air from the office supply stores) every 6 months to clear the dust-bunnies out of the PSU, all fans and the CPU heatsink. Have not had issues since instituting this schedule.

@neutron68: Check to make sure all fans are spinning at a reasonable speed. Check your temps with sensors, might want to setup a script that runs this every so often, placing the output to a file so you can review it after a lockup.

@knappster: You could be right about the Northbridge on the M2NPV-VM. The stock heatsink on my M2NPV-VMs weren't making very good contact due to the thermal paste they used at the factory, I removed each of them and replaced the paste. No problems with any of my 4 boards in years since I did that. I also had one board that would freak out in different ways with no apparent consistency, since upgrading the BIOS on that one it has been well-behaved (according to dmidecode that one is running BIOS rev 1401)



Thanks for the response. It's probably a good idea to try to re-apply thermal paste on the northbridge and reseat the heatsink. I thought it looked crooked on mine actually, and I thought about reseating it when I noticed, but it did not seem like it wanted to come off and I think I saw pictures of it online where it looked crooked as well, so I assumed it was just how it was designed. Is there any trick to removing it? I have read that most/all aftermarket northbridge heatsinks will not fit on it as well, so we're stuck with the dinky one that comes with it. I have BIOS 1401 as well. I wouldn't be so hesitant to start over except for all of the little things I have had to do to get everything working as it should.

I also discovered that I have an Abit AN-M2 in my closet that they sent me as a warranty replacement for a different board a couple years ago. I have never tried to even power it on yet, so I may try moving everything to it and praying that it just works without reinstalling. The only disadvantage is that it has no serial port, or even a serial port header. I would have to buy a PCI serial port in order to use my homebrew IR blaster :(


I replaced the M2NPV-VM with the Abit AN-M2 motherboard, did a fresh install of R7, got everything configured, then the motherboard died 2 days later. I re-seated the Northbridge Heatsink on the M2NPV-VM with Arctic Silver 3, did a fresh install of R7 (again) and it has been running for over a month now without any lockups (knock on wood).

I just thought it was worth mentioning in the event anyone else was having the same problem.


Top
 Profile  
 
 Post subject: motherboard died?
PostPosted: Wed Dec 28, 2011 10:04 pm 
Offline
Joined: Tue Mar 28, 2006 8:26 pm
Posts: 804
Location: Minneapolis, MN
knappster wrote:
I replaced the M2NPV-VM with the Abit AN-M2 motherboard, did a fresh install of R7, got everything configured, then the motherboard died 2 days later. I re-seated the Northbridge Heatsink on the M2NPV-VM with Arctic Silver 3, did a fresh install of R7 (again) and it has been running for over a month now without any lockups (knock on wood).

I just thought it was worth mentioning in the event anyone else was having the same problem.

I'm glad you are running without lockups now! I know it was really bothering me!

When you said "the motherboard died", did you mean the motherboard stopped working for good and you replaced it or, that it locked up and you then worked on the Northbridge heatsink?

Eric

_________________
KnoppMyth R5.5, Asus A8N-VM CSM (nvidia 6150 onboard video), AMD Athlon 64 dual-core 4200+, two 1GB sticks DDR 400, HD-3000 HDTV card, PVR-150 card, Iguanaworks RS-232 IR receiver/transmitter, Pioneer DVR-110 DVD burner


Top
 Profile  
 

Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 32 posts ] 
Go to page Previous  1, 2, 3  Next



All times are UTC - 6 hours




Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Jump to:  
cron
Powered by phpBB® Forum Software © phpBB Group

Theme Created By ceyhansuyu