libera/#devuan/ Monday, 2024-02-05

gnarfacecousin_luigi: the system i found that bios bug in with was also AMD03:41
gnarfaces/with//03:41
gnarfaceone thing i discovered that was a very reliable way to reproduce the problem was to scp a very large file to it03:43
gnarfaceit would sometimes be fine for weeks if idle, and without any sustained disk/io, any amount of cpu load was fine03:44
gnarfaceif you can take a summary of which power management features are provided by your bios, i may be able to sanity check them for you03:45
gnarfacesome of these old AMD motherboard bioses in the wild, even in their most recent versions available, are still floating around out there with some nasty bugs03:46
gnarface(the other "wontfix" bug i've discovered proliferating seems to be an issue just with the samsung evo SSDs)03:47
cousin_luigignarface: How do I do that?07:23
cousin_luigiI mean, is there a tool or you mean from the post menu??07:25
gnarfacecousin_luigi: i meant from the post menu, sorry07:27
cousin_luiginm, will try, but it's a rather simple thing07:29
gnarfaceyea, sometimes they just don't give you the options you need, it's a pity07:30
gnarfaceDell BIOS used to be like that, though i admit i haven't seen one lately07:30
gnarfaceASUS seems to ship a lot of these bugs too, but at least they give you a way to shut off the broken features07:31
gnarfaceif there's some "advanced" mode try to find it07:34
cousin_luigignarface: https://imgur.com/a/OH7FFCs07:36
cousin_luigi(note, I'm not sure those are the current settings, that's a video I took months ago)07:37
gnarfacecousin_luigi: wow, is that all? i see what you mean...07:53
cousin_luigignarface: It's a thin client, not a regular motherboard07:54
cousin_luigiWill also run memtest86+ later.07:54
gnarfacecousin_luigi: i do wonder if the super vague "Idle power Savings" = "Extended" could be a vague simplification of several other controls... i'd be curious what the other options were besides "extended" and maybe try one of the other settings07:55
gnarfacelike for example if there were 5 settings and "Extended" was the maximum setting, i'd try the one immediately below that first07:56
gnarfaceand as for the S5 thing... i had no idea it even went up to S5, that one's new to me, but i see you have it disabled already07:56
gnarfaceif you have a way to check the power usage at idle and there's a change that clearly saves a static amount of watts but only literally just about 2-3 of them, i'd say that'd be a prime suspect07:57
gnarfacewhen i was initially troubleshooting this, i was told by several people that it was a sign the CPU was going bad, not the RAM, but after changing the CPU twice i started getting more pedantic about my testing regime, and managed to deductively isolate that "C1E Powersaving" feature toggle as the culprit07:59
gnarfacebut with comparatively less granular controls, all you can really do is the same process and hope one of the batches of features these options you have includes it08:00
gnarfacebatches of several at once that these options are masquerading for, really, i suspect08:01
cousin_luigi gnarface: I don't recall any further options, also see last picture I added08:02
gnarface(though i'm not sure, i assume the "Runtime Power Management" thing is just whether to allow the kernel to change the cpu clockspeed through the regular cpufreq modules or not)08:02
gnarface(and that one you'll definitely want)08:02
cousin_luigignarface: The only thing I could do was changing the governor, which I set to powersave08:02
cousin_luigiThe rest is internally managed, apparently08:03
gnarfaceif it's the same thing i was running into, the actual clock speed or governor you use won't end up mattering much, what will really matter is sustained disk io08:03
gnarfacenot that it can't happen on a relatively idle system, but sustained disk io was the one thing i found to reliably increase the chances of it happening08:04
cousin_luigignarface: The device is used as a router, with root mounted read only.08:04
gnarfacehmm, well for what it's worth temperature sensor polling seemed to be equivalent to disk io for the purposes of triggering the problem08:05
cousin_luigitemperature you say hmmm08:05
gnarfaceyea, the first mitigating factor was i took temperature polling out of the munin runs' task set08:06
cousin_luigi"munin"?08:06
gnarfacejust generic system monitoring software08:06
cousin_luigiI see. I reduced the idle fan speed to the minimum, to see if I could reduce the noise.08:07
cousin_luigiBut I'm not 100% sure it didn't happen before.08:07
gnarfaceanyway, that dropped the frequency of the occurances a lot, it wasn't until later i discovered the much more reliable triggering use case of scping a big disk image up08:07
cousin_luigiI see. Will try a parameter at the time. Yesterday I removed the NIC firmware, since the tg3 module appears to work correctly even without it. And at the same speed.08:08
gnarfacewithin a few gigabytes of transfer that would cause the problem 100%08:08
gnarfacebut it had to be going onto disk, it didn't seem to matter if it was just passing through08:08
gnarfacei'm not sure what analogues your system might have to that basic type of traffic that might be triggering this but it seems possible there's something08:09
gnarfacecousin_luigi: by the way, if the fan has 3 or more wires going into it, i'd strongly recommend letting the bios handle it as generously as possible, but i do sympathize with wanting to make things quieter... i doubt that's the cause of the issue here either way08:12
gnarfacecousin_luigi: if this is a amd64 cpu in this device, i would strongly recommend loading the amd64-microcode though08:13
gnarfaceamd64-microcode package, i meant08:13
gnarfacecousin_luigi: oh and to be absolutely clear, temperatures didn't affect it, what affected it was how often the motherboard sensors were polled08:20
gnarfacewhy just that and disk io, i could only speculate08:20
cousin_luigignarface: Hmm, not loaded by default on devuan.09:15
cousin_luigiAnd memtest86+ had a successful pass.09:15
cousin_luigignarface: So, you would recommend removing blacklist microcode?09:16
cousin_luigiHow do I check if the cpu microcode has been loaded?09:48
cousin_luigioh, it's in dmesg09:49
cousin_luigihmm, do I really need to unblacklist the ucode if it's already in the initrd?09:55
gnarfacecousin_luigi: blacklist? not sure what you're talking about there. afaik all i've ever had to do was install the package...10:23
gnarfaceit isn't installed by default because it's in non-free (non-free-firmware, as of daedalus) but still sometimes carries important security and stability fixes10:24
gnarfacei can't be sure that's relevant to your issue here but it's a good idea to have it either way10:25
gnarfacesince we're not sure what else you might be missing if you were missing that, check to make sure you have acpid installed... it's critical for functional power management in varying degrees depending on the hardware10:27
cousin_luigignarface: Please check /etc/modprobe.d/amd64-microcode-blacklist.conf10:28
cousin_luigiThat comes by default with amd64-microcode10:28
cousin_luigihmm, acpid is not installed10:29
gnarfacecousin_luigi: oh, i see. yea i have that too, i hadn't realized that was there... that's interesting.... maybe you're right though and the initrd does it so this one doesn't have to...10:30
cousin_luigignarface: Meanwhile, the damn thing crapped out on me again, removed the governor thing to see if it makes a difference10:30
cousin_luiginow, one can see the microcode version in kern.log, but how does one make sure it's the latest one?10:32
gnarfacecousin_luigi: ah, check /usr/share/doc/amd64-microcode/README.Debian.gz10:32
cousin_luigi(I thought I did)10:32
gnarfacecousin_luigi: it says with the default debian kernel it should be enough to just install the package10:32
gnarfaceand it is doing it from the initrd, where it's safer, that's what i'm gathering from this10:32
cousin_luigiindeed, that was my conclusion as well10:33
gnarfacebut it also says you can trigger a manual immediate microcode update if you want with: echo 1 > /sys/devices/system/cpu/microcode/reload10:33
gnarfacebut it says specifically, "DO THIS ONLY WHEN YOU KNOW BETTER"10:33
cousin_luigiI get a permission denied10:33
gnarfaceit says you need to be root10:34
cousin_luigiwhich I am10:34
gnarfacehmm10:34
cousin_luigiperhaps I need to load that module first?10:34
gnarfaceno, i wouldn't10:34
gnarfacei would just assume it's working TBH until i talked to someone smarter10:34
cousin_luigianyway, it's not a big deal since I've rebooted a number of things10:34
cousin_luigitimes*10:34
gnarfacewhen you said it died just now, was that with acpid running?10:35
cousin_luigino10:35
cousin_luigiacpid wasn't even instaleld10:35
cousin_luigiinstalled*10:35
cousin_luigiWill have to wait a bit, I suppose.10:36
gnarfacei've only seen it actually cause stability problems on nvidia video cards when missing, but we're just crossing stuff off the list to make the list shorter here10:36
gnarfacepowersave was probably the most reliable cpufreq governor but it might be worth trying ondemand and schedutil just to see if it changes anything about the randomness, that could be a clue10:37
cousin_luigiBy the way, would you force amdgpu when the default is radeon? I returned that to default too. The card is not in use anyway.10:39
gnarfacei would decide by the card generation i have, some can do both and in that range, some are better at one than the other10:39
gnarfaceoff the top of my head i'm not super clear on the ones closer to the transition point10:40
gnarfaceanything after RX 480 i'd definitely go amdgpu though10:40
gnarfaceeither way, if you're using a graphical desktop make sure you have enough of the mesa packages installed to actually use the driver right... it tends to miss a few by default10:41
cousin_luigignarface: No, it's headless. I would consider changing it only for stability and power consumption purposes.10:42
gnarfaceat one point there was also a "radeonsi" kernel scheduler10:42
gnarfacefor a certain amount of the cards, i think also the ones close to the transition point, that scheduler will make a world of difference in performance, reportedly10:43
gnarfacethat might be stale news10:43
gnarfacei'm not sure if it will matter if you're running headless10:43
cousin_luigiI think it is a southern islands device actually10:43
bob123Hi all . I would be grateful for help or advice. This error breaks Devuan, the system becomes uncontrollable, the cursor moves randomly, the pages themselves close, etc.11:22
bob123This error occurs on qemu-devuan. Local machine -Devuan, virtual machine -Devuan11:22
bob123This is the error11:22
bob123kernel: [362.059103] psmouse serio1: VMMouse at isa0060/serio1/input0 lost sync at byte 111:22
bob1232024-01-29T11:06:02.376836-05:00 bob123 kernel: [ 362.059948] psmouse serio1: VMMouse at isa0060/serio1/input0 - driver resynced.11:22
gnarfacemy first advice for bob123 is to stick around longer12:20
cousin_luigiBy the way, what is the function of acpid if there's no keyboard, nor monitor?13:06
djphcousin_luigi: power control for all the other hardware?13:08
cousin_luigihmm13:08
nemohey folks. I've had the weird situation for a while, that a bunch of my packages have been held. and I'm trying to figure out what triggered it18:26
nemoso.  I tried manually installing one for more of a hint18:26
nemoapt install libsmbclient...18:27
nemoand it said.  "limbsmclient : Depends: samba-libs 2:4.19.4 but 2.4.18 .. etc libtalloc2  2.4.1~ but 2.4.0-f2... libtevent0 0.15.0 but 0.14.1-1... unable to correct you have held broken packages18:27
nemoso. now my question is.  how do I figure out where those versions are coming from?18:28
nemohm. maybe synaptic can magically figure it out for me. it seems good at traversing package graphs sometimes.18:28
nemo... if synaptic is actually working. I've had a lot of trouble with it lately18:28
djphnemo: did you forget to run apt-get update first?18:29
nemonope18:29
nemos/libmbsmclient/libsmbclient/18:30
djphcrossing daedalus and non-daedalus repos?18:32
djphpft, apt-cache depends doesn't give the version18:32
nemodjph: don't think so... checking18:51
nemodjph: nope18:51
nemo4 lines in sources.list  main security updates backports18:52
nemohm. I did have to add non-free to backports though18:52
nemolet's see why18:52
gnarfaceare you aware they moved a bunch of drivers from "non-free" to the new "non-free-firmware" as of daedalus?18:53
nemoit was a deprecated pos that I despise. cylance18:54
nemoremoving it now18:54
nemolet's see if that helps18:54
nemognarface: not sure I need that right now, but definitely good to know. thanks.18:55
nemomaybe if I'm mucking about w/ amd + opencl again and want to avoide a frankenpackage18:55
nemodamn. removing cylance did not help18:55
nemois there any easy way to figure out why stuff is being kept back?18:55
nemoI guess I can just keep running apt install over and over as I descend the packages18:56
nemodeb.devuan.org/merged is correct location right?18:56
djphhm, packages.debian.org sayos libsmbclient depends on samba-libs 2.4.17 ...18:58
djphoh wait, *backports* though ... that pulls in 2.4.19 ...18:59
djph23 2:4.19.4 ... i'm not seeing a 2:4.18.x anywhere in bookworm's repos though19:00
nemothis is so weird.  I've tried explicitly enumerating each package and it isn't going further down the tree19:08
nemohttps://m8y.org/tmp/devuan_broken.txt19:11
nemoopen to any suggestions19:12
nemolet me see what else is under /etc/apt19:12
nemogoing to try to clean it up a bit19:12
nemoremove some foo~ files etc19:12
nemocould I possibly have old config under /etc/apt/apt.conf.d  that could be a problem?19:13
nemohm. nothing too worrying there, eyeballing it. all seems pretty sensible19:14
nemoI do have an /etc/apt/preferences.d  I don't remember setting called "avoid-systemd" that pins the package systemd-sysv to "Pin: release o=Debian" and "Pin-Priority: -1"19:15
nemoI would hope that isn't an issue19:15
nemoeverything else seems fine. so confused19:16
nemois there a way to make apt regenerate whatever caches of package dependencies it might have?19:16
nemomaybe I can get some clues w/ apt-cache depends19:31
nemoaptitude says my choices are to leave those 3 packages alone19:42
nemoorrrr remove gvfs-backends libldb2 libsmbclient mate-desktop-* python3-smbc and samba-libs19:43
nemoso... I guess I'm leaving those packages alone19:43
nemohopefully it's not a problem19:43
nemofor anyone wondering here. was given a solution that seems to work19:58
nemoapt install -t daedalus-backports libsmbclient19:58
nemoapparently it wasn't using the backports package...19:58
nemoafter running that. nothing stuck19:58
eyalrozI need your help, my friends...20:58
eyalrozwith initramfs-tools after an apt-get dist-upgrade a week or so ago.20:59
eyalrozwell, first, it was failing because it looked for /sbin/dmsetup and wasn't finding it, since it's under /usr/sbin,20:59
eyalrozbut I've already gotten used to this pattern, so I figured out that was the case and fixed it.21:00
eyalrozBut now, it insists on finding certain rules in /usr/lib/udev/rules.d, which have to do with dm, and those rules don't exist21:01
eyalrozUh, sorry, I meant to say, the rules don't exist under /lib/udev ; but they do exist under /lib/udev/ - which also has its own rules directory21:02
eyalrozThose sets of rules are quite confusing... some of the rules are here, some are there.21:03
eyalrozCan I safely merge them?21:09
eyalrozAnd should I actually merge /usr/lib/udev into /lib/udev entirely?21:09
rwpeyalroz, The current direction is that Devuan will follow Debian with the UsrMerge. This patch has been proposed and will likely be accepted: https://git.devuan.org/devuan/base-files/commit/fc611fab26716b829c3cd85eb3a45d2dcae0d3d421:33
rwpWhich means that if you can rescue the system enough to boot that the way forward is most likely to "apt-get install usrmerge" and then reinstall and dpkg-reconfigure as needed to get the initramfs rebuilt.21:33
eyalrozrwp: So, I managed to get the initramfs-tools to configure21:38
eyalrozAnd my system never stopped booting21:39
eyalrozIt's just that I did some of that usrmerge'ing manually21:39
eyalrozSo I am a bit hesitant about what would happen when I try to install this - with my file moves and symlinks and such21:40
rwpI suggest using the debian-installer netinst iso to boot and then use Advanced->Rescue mode to rescue the system.  Or other favorite boot iso if you like.  But Rescue mode guides through it nicely.21:40
rwpI am rather in the same position.  I have hacked on my system quite a bit too.  So not sure what usrmerge is going to do.  I need to inspect it.21:40
rwpI think all of the work is being in the postinst script.  So I need to "less /var/lib/dpkg/info/usrmerge.postinst" and browse and inspect it.21:41
rwpThat's after it has been installed on a system.  But I can look at it on another system.  Or "apt-get download usrmerge" and then unpack the .deb and look at the script from the package.21:43
nemoFWIW, the whole reason I was attempting to update samba, is still broken21:53
nemobasically I was hitting this:21:54
nemohttps://lore.kernel.org/all/ZZhrpNJ3zxMR8wcU@eldamar.lan/21:54
nemoappears to be a kernel regression pulled in from upstream21:54
nemohttps://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1060005  the debian bug21:54
rwpThe Linux 6 series kernel has had quite a few bugs recently.  I am still pinning to 6.4 which is before other badness for me.  I haven't had time to test newer kernels.21:57
nemoit looks like a kernel maintainer created a build in that bug that people are confirming fixed21:59
nemohow would I check to see if this has made it to "try" server yet?21:59
rwpSorry but I could not parse your question.  Meanwhile...  I suggest booting a previously known good kernel.22:01
nemorwp: he made a build that people tested. my understanding is that before something goes out to backports, it goes through a "try" repository first with debian22:10
nemorwp: I figured I could use that. but. yes this regressed very recently, so I guess I can just switch to a slightly older kernel22:10
rwpLinux 6 has been a rough path.  Good luck!22:15
coyotes4yshelp! i have crowz devuan, openbox, and i keep having it freeze up in various ways: twice today i was playing a video and everything froze including inputs, then once everything froze except the mouse cursor could be moved but clicking did nothing, then a few minutes ago i could still type things in term but everything came back command not found or input error (iirc)23:24
coyotes4yshelp! i'm back23:31
coyotes4ys_anybody here? i really need help23:33
coyotes4ys_AEonFyr, krzych, nckx i'm sorry to call random people but i really need some help on crowz devuan23:35
coyotes4ys_ugh23:36
eyalrozrwp: I don't need a rescure, I boot fine... anyway, I'll try downloading and checking what it does..23:41
rwpWhen that hit me in Unstable my system would not boot due to the corrupted initramfs.  I needed to rescue my system.23:42
coyotes4ys_rescue how23:42
coyotes4ys_thank you rwp how?23:42
rwpHi coyotes4ys!  That comment was intended for eyalroz and our previous discussion.23:43
coyotes4ys_oh23:43
coyotes4ys_i am having a serious prob, system keeps crashing23:43
rwpcoyotes4ys, as to your problem, what graphics adaptor do you have?  (Please don't say nVidia.  Unless it is.  Then do admit to it.)23:44
coyotes4ys__whiskeylake-u gt2 [uhd graphics 620]23:46
coyotes4ys__rwp23:46
rwpThat's the Intel graphics which is in Linux kernel main and should be good.  I am using it too on my other machine and it works okay.  Should be anyway.23:50
rwpAfter a freeze is there anything interesting in /var/log/syslog?  (I always ask but I never find anything about freezes when I am having problems either.)23:51
rwpUnfortunately I am in the middle of $WORK stuff and also don't really know what to suggest.  Maybe others in the channel will have better ideas.  Good luck!23:52
coyotes4yshelp me!23:56
coyotes4ysrwp i'm back23:56
coyotes4ysdid you get my answer23:56
coyotes4ysfor anyone who wants to help, first my system was crashing but still displaying deskptop etc as before, but inputs wouldn't work or only somewhat worked,23:57
rrqcoyotes4ys: is the disk(s) full?23:58
coyotes4ysthank you rrq! no not full23:58
coyotes4ysthunar and conky show hundreds of gigs free23:58
coyotes4ysalso just now the thing went straight to asus boot options, no bootable os detected! twice this happened, then third time, crowz devuan came back up and i'm in that now!23:59

Generated by irclog2html.py 2.17.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!