|
Grex > Helpers > #145: Upgrading Grex to 3.8 - a postmortem | |
|
| Author |
Message |
| 25 new of 62 responses total. |
eprom
|
|
response 25 of 62:
|
Nov 20 19:32 UTC 2005 |
Thanks ya'll.
|
scholar
|
|
response 26 of 62:
|
Nov 20 19:44 UTC 2005 |
Re. 23: Thanks, Bruce!
Re. 24: I firmly believe other's foreskin ascended to the heavens and now
forms the rings of Saturn.
|
cross
|
|
response 27 of 62:
|
Nov 20 19:49 UTC 2005 |
This response has been erased.
|
scholar
|
|
response 28 of 62:
|
Nov 20 19:55 UTC 2005 |
I'm a bit upset that md ignored me crying on his shoulder.
Dear Mr. Expert Witness,
I would be willing to hire you for a small cash fee to testify in any legal
cases I may or may not bring against Mr. ANdre.
I wonder if that will get his attention.
|
steve
|
|
response 29 of 62:
|
Nov 20 20:20 UTC 2005 |
You just try that, 'scholar'.
|
scholar
|
|
response 30 of 62:
|
Nov 20 20:22 UTC 2005 |
Okay!
I will indeed try to either bring a legal case against you or not bring a
legal case against you.
No joke.
|
steve
|
|
response 31 of 62:
|
Nov 20 22:11 UTC 2005 |
Grex is now running with 512M of ram. There are three DIMM
slots on the motherboard. I coudn't see any markings for which
was slot 0 (of 0-2) so guessed that they'd be numbered left to
right. That turned out to be right. I took all three out and
put the second DIMM (slot 1) into slot 0. Given that this is
all a crap shoot I figured that moving everything around was
the best idea.
If Grex becomes stable we still don't know where the
problem lies, exactly. It could be the ram. I plan on
calling Crucial tomrorow and see about sending the two
unused DIMMs to them for testing. While I believe the
memtest86 program results, Crucial has the best hardware
to test ram.
Grex has been running for almost an hour now with
a couple of programs comiling in the background. We'll
see if it continues...
|
rcurl
|
|
response 32 of 62:
|
Nov 20 22:37 UTC 2005 |
"a value that had been written to memory by the CPU, that differed in one
bit when it was read back"
I had that problem back in 1958 or so on a Datatron.
|
steve
|
|
response 33 of 62:
|
Nov 20 22:38 UTC 2005 |
We've now been running for 45 minutes running normal
things, plus two infinate loops of compiling a 100,000
line C program. These last two items are causing us to
swap. Grex is definitely busy at the moment. I'm going
to let this run as long as I can.
|
steve
|
|
response 34 of 62:
|
Nov 21 01:02 UTC 2005 |
After 20 or so minutes of a few programs compiling, I bought it
up to 10, started swaping like mad and got the load average up to
68 for several minutes. Grex didn't crash after more than 30 minutes
of this.
|
cross
|
|
response 35 of 62:
|
Nov 21 01:20 UTC 2005 |
This response has been erased.
|
bhoward
|
|
response 36 of 62:
|
Nov 21 01:24 UTC 2005 |
Dan, as far as I know, no one has looked into recompiling grexsoft
or anything else outside of the grexdoc structure bar possibly a
few things manually installed from the ports tree.
We're flying into Annapolis early tomorrow but I may be able to
take a look into it once things have settled down later this week.
|
bhoward
|
|
response 37 of 62:
|
Nov 21 01:28 UTC 2005 |
(35 slipped in)
It looks like it was a controlled reboot and Steve logged back in
right after the system came up. I suspect it was part of the system
memory swapping and testing he's doing this evening.
|
steve
|
|
response 38 of 62:
|
Nov 21 01:28 UTC 2005 |
Grex rebooted 32 minutes ago because I had to bring it down in order
to put it back in its little home. Grex lives in the Attic. I gotta
get some pictures of this place. It definitely looks like an Attic.
Grex lives in the bottom of an empty 19in rack mount. You can't stare
at the hardware when its there, so it has to be brought out and hooked
up on a table next to the rack.
The test I ran earlier tonight put real strain on the system. I've
never seen an OpenBSD system with a load average of 68. I kept the
system stressed for at least 30 minutes, to see what would happen.
I think we're in better shape now.
|
naftee
|
|
response 39 of 62:
|
Nov 21 04:04 UTC 2005 |
steVE's wife is glandular.
|
naftee
|
|
response 40 of 62:
|
Nov 21 04:07 UTC 2005 |
"glendular" is more funny.
|
scholar
|
|
response 41 of 62:
|
Nov 21 05:46 UTC 2005 |
I agree.
|
photeus
|
|
response 42 of 62:
|
Nov 21 17:01 UTC 2005 |
If the issue is a memory issue a standard diagnostic may overlook it.
I recommend a program called "tufftest pro" It has it's own mini OS.
It is able to relocate it's OS into a diffrent memory locatiojn therby
allowing you to do a comprehensive test on all memory. http://tufftestpro.c
om
|
jep
|
|
response 43 of 62:
|
Nov 21 17:06 UTC 2005 |
Thanks, staff.
|
steve
|
|
response 44 of 62:
|
Nov 21 17:35 UTC 2005 |
I've never seen memtest86 not spot bad memory. I have once seen false
positives, but not negatives. If the two DIMMs I have have a problem
this will be the first time memtest has failed me.
|
tod
|
|
response 45 of 62:
|
Nov 21 17:45 UTC 2005 |
Did you try a white eraser on the contacts of the original SIMMS?
|
eprom
|
|
response 46 of 62:
|
Nov 21 18:06 UTC 2005 |
a pink erase is just as good.
|
albaugh
|
|
response 47 of 62:
|
Nov 21 18:12 UTC 2005 |
So is our mail before the wondrous upgrade gone forever and anon, with no hope
of restoration? I suspect the answer is "yes, so sorry", in which case that
becomes "the final straw".
|
nharmon
|
|
response 48 of 62:
|
Nov 21 18:15 UTC 2005 |
You gotta blow on the DIMMs and jiggle them like the old Nintendo
"tapes" I had as a kid.
> So is our mail before the wondrous upgrade gone forever and anon, with
> no hope of restoration?
Kevin, I am sure you would be more than welcome have the old drive
analyzed and the data restored. Of course, that would be at your own
expense, and you would probably need to lend a replacement to Grex for
the interim.
|
steve
|
|
response 49 of 62:
|
Nov 21 18:33 UTC 2005 |
I'm sorry Kevin, the data is gone. The data isn't recoverable,
because the disk is in use now. There isn't any way to recover
it. I thought I'd made a complete backup of the /var partition,
which I did not. I lost mail too.
|