This issue is a long time coming. I got a mini pc (Asrock Deskmini h110, i5-6400, 16gb) that I have used for a long time with Kubuntu/Kde Neon, and most of its life, it worked great. Some years ago, it started freezing, especially at Graphic intensive workload, so I thought some hardware issue and converted it into a NAS and it worked absolutely fine as well for a couple of years there too. Recently my wife needed a Windows PC to do some work, and since I had upgraded my NAS, I repurposed the same PC and installed Windows on it, and it worked absolutely fine for her too. Then I decided to check some Graphics intensive workload, like 3d benchmarking stuff, and it didn’t freeze once. I was delighted, and thought maybe I didn’t investigated the issue the first time, and the PC was fine all along. So I reinstalled Debian 13, and lo behold, the issue came back. I found out while I was using IKEA’s 3d kitchen planner. So I replaced distros, and it froze on Ubuntu and CachyOS as well. I tried switching between Wayland and X11, switched browsers, but PC freezes seconds logging into IKEA’s kitchen planner (as soon as 3d graphics are loaded). I reinstalled windows, and my wife has been designing a kitchen in IKEA’s 3d kitchen planner for over an hour now, and it hasn’t frozen once. What’s going on? How do I even investigate this?

I have reinstalled Linux and had sudo dmesg -w running, but no logs are captured before it’s frozen. I have reproduced the issue multiple times now on Linux, and not once it froze on Windows. I have also done memtests, and tried multiple disks both nvme and sata. Also have tried multiple browsers with apt and flatpaks. I really need Lemmy’s collective intelligence to help me here.

Update: Well system stopped hanging with ‘nomodeset’ as boot parameter, confirming that it’s Intel 915 driver. I tried variety of Intel related kernel parameters like psr, dc, guc, even messed with Intel cstate, but it hanged every time Intel 915 driver is even loaded. So ???

  • anon5621@lemmy.ml
    link
    fedilink
    arrow-up
    18
    ·
    1 month ago

    I feel like it possible bug in i915 driver at least it sounds so from symptoms , try add to boot parameters

    i915.enable_psr=0 i915.enable_rc6=0
    

    About log try enable debugfs

    mount -t debugfs debugfs /sys/kernel/debug

  • Tzeentch@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    20
    arrow-down
    5
    ·
    edit-2
    1 month ago

    Linux is kind of dogshit at memory management unfortunately, due to a thing called overcommit it can essentially commit more memory than there actually exists on the system, cue potential for hard locks, stutters and freezing, there are some mitigations

    Now the common recommendation is brute forcing the issue by making a swap partition or file, this is generally a bad idea on a modern system, due to the speed of ssds and even some fast spinning hard drives its possible for the kernel to get confused and think there’s enough memory spare where it becomes less snappy about clearing memory, often ironically making the problem worse, generally only use a swap partition if you need something else like hibernation

    What should instead be done these days is setting up zram which essentially does compression of your memory in ram so that it stays clear for as long as possible, zswap does something similar but will also fallback to a swap partition soon after, the two are very similar so just setup zram on installs that dont have or need a swap partition, and setup zswap on ones that need or already have a swap partition(as they’re annoying to remove in an existing install)

    But we’re not done yet because while this will buy you a lot of time there is still another massive issue at play, the linux kernel’s oom killer doesn’t generally care how full your memory is unless it starts biting into the kernel’s needs, this means it can take ages before it does anything even if everything is so overloaded where your display server has been hard frozen for 20 minutes, and since it has no real priority outside of the kernel it may force shut off vital processes first rather than secondary ones

    To fix this you want a userspace oom killer that keeps in mind the entire OS and prioritises what needs to go first, many exist but if you want something that just works then EarlyOOM is great

    With these mitigations in place your OS should become much less prone to freezing and stutters, if you really want a nuclear option there’s also cgroups

  • BananaTrifleViolin@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    ·
    1 month ago

    A quick google search shows that many people have issues with the Intel Integrated Graphics on this particular PC (including on Windows) and this seems to be the solution: https://forum.asrock.com/forum_posts.asp?TID=17021&title=solution-for-intel-graphics-freezing-system

    The main solution is Windows based but someone does offer a Linux route to the same solution, although a linked file that sounds like setting it up may be easier is missing. Essentially it looks like the chipset needs tweaking to throttle the GPU slightly to prevent the flaw triggering.

    • atk007@lemmy.worldOP
      link
      fedilink
      arrow-up
      4
      ·
      1 month ago

      Well you sent me down a rabbit hole and for a while I thought it was definitely the issue. I found a script that does what the suggested solution in the link says, and I adjusted iccmax values for GPU, but it doesn’t matter what parameters I adjusted, it kept freezing. I tried BIOS firmware downgrade and updates, changed undervolt parameters and so many other stuff, but didn’t help. In fact it kept freezing without going into graphic intensive work, so I tried reinstalling different distro with different DE, yet all of them froze so much much that I gave up. Reinstalled Windows and it hasn’t freeze once. It’s hard to admit as a Linux evangelist but windows works for some bloody reason where Linux simply fails.

      • Nibodhika@lemmy.world
        link
        fedilink
        arrow-up
        4
        ·
        1 month ago

        It’s hard to admit as a Linux evangelist but windows works for some bloody reason where Linux simply fails.

        Why would that be hard to admit? This is a bug in a specific hardware that has a crappy driver for Linux, Intel doesn’t care about Linux, if Linux was 90% of the market instead of 2% I assure you this bug wouldn’t exist.

        • atk007@lemmy.worldOP
          link
          fedilink
          arrow-up
          4
          ·
          1 month ago

          Because I have been asking my wife to switch to Linux for sometime and one time where she finally agrees, this is what happens. Your reasons are valid but even if I could explain all that to her, she already got one of the worst impressions about Linux with this fiasco. I don’t know how to explain this but this small little thing, this bug, not only has kinda killed my credibility as a tech person in my house, it has also killed the hope of moving away from big corporations towards self hosted services.

  • SSUPII@sopuli.xyz
    link
    fedilink
    arrow-up
    11
    ·
    edit-2
    1 month ago

    Are you sure you are not hitting swap too hard? Windows by default makes a bonkers massive page file, while most distributions try to limit wasting so much for swap.

    If so, try installing SwapSpace

      • Sonotsugipaa@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        1 month ago

        It’s not about the amount of swap space, it’s a problem that happens when swapping happens for big chunks of data at a time.

        Windows aggressively swaps out things way before it’s necessary, you can try increasing the system’s “swappiness”; I’m writing this from my phone, but when I get to my PC I’ll write out how to do it (unless somebody else does it before I do).

        You can set it by writing vm.swappiness=60 in a file like /etc/sysctl.d/50-swappiness.conf.
        The value 60 is arbitrary, if you increase it the system will try to swap out things more aggressively; the name of the file is also partially arbitrary, but AFAIK, it has to begin with two digits — the system will read all the files inside /etc/sysctl.d in order, and the settings in higher-numbered files will be applied over lower ones.

        Officially, this is the explaination of the vm.swappiness parameter.
        You can read and write the value with your shell:

        #!/usr/bin/bash
        sysctl vm.swappiness  # shows you the current value
        sudo sysctl vm.swappiness=69  # sets the swappiness to 69 AND shows you the new value
        
        • atk007@lemmy.worldOP
          link
          fedilink
          arrow-up
          4
          ·
          1 month ago

          Yeah I tried that, didn’t help. I don’t think it’s swap. It has to do with graphics or display driver.

        • TMP_NKcYUEoM7kXg4qYe@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          1 month ago

          This is unrelated but I wonder if I could get better Windows VM performance by disabling swap for the VM. I use an old laptop with slow drive. I wonder if aggressive swapping could be the reason why my Windows machine feels frozen all the time.

          • Sonotsugipaa@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 month ago

            Windows 10 and 11 really dislike HDDs, that’s probably why you can’t admit to using HDDs online without getting stones thrown at you (I’ve been there before).

            I’ve disabled paging files (= swap) for one of my Windows VMs, unfortunately - to my surprise - that only had a small performance boost, and I still need to let the VM chug for a few mintes before it even lets me open File Explorer.

            … but it does improve performance, definitely consider doing it if you don’t need swap/paging/whatever they call it now.

  • data1701d (He/Him)@startrek.website
    link
    fedilink
    English
    arrow-up
    10
    ·
    1 month ago

    When it freeze, after you’ve rebooted it, try running sudo journalctl -p 5 -b -1; you might see something in those logs.

    Maybe also open a task manager before you do anything graphics intensive, just to see if there’s a process that rapidly increases its memory usage; while it might not be the cause, I’ve experienced similar freezes when I use all my memory (on a machine with 32GB of RAM).

  • sneaky@r.nf
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 month ago

    Not sure this is the answer, but going to throw my two cents. If you try and it works let us know.

    When I got back into linux a couple years ago I hopped through all of the distros you mentioned. The last one being KDE Neon. When I first found it I absolutely loved it. Decided that was going to be my main distro and started migrating all my systems which is a couple laptops, gaming desktop, and mini PC.

    Over time I found that I was having minor, but consistent hardware issues. Similar to yours, freezing, and other gpu issues. It was most apparent on systems that had newer hardware. Looking at the specs for your mini PC it seems a bit older than what I have so again, not sure this applies to you, but I found my saving grace in Fedora. My issue specifically was the older kernel Neon uses not interacting well with my newer hardware and in some cases not having access to some hardware features. Fedora had a KDE spin otherwise I wouldn’t have done it. It has been my daily driver on all systems since.

    TL;DR: Try Feodra KDE Spin or any distro that ships with a more up to date kernel

  • malwieder@feddit.org
    link
    fedilink
    arrow-up
    6
    ·
    1 month ago

    journalctl -f (-f is follow mode) or as root user (sudo journalctl -f), open kitchen planner with the terminal visible and look at the (frozen) logs.

  • Tiempo@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    6
    ·
    1 month ago

    Frezze like not even the mouse is responsive? If is a yes, it’s a known bug for some chipboard and can be resolved easy. As soon as I get home I will look at it

  • deadcade@lemmy.deadca.de
    link
    fedilink
    arrow-up
    4
    ·
    1 month ago

    SSH in from another machine, and sudo dmesg -w. If the graphics die, it can’t display new logs on the screen. If the rest of the system is fine, an open SSH session should give you more info (and allow you to troubleshoot further).

    You can also check if the kernel is still functional by using a keyboard with a caps-lock LED. If the LED starts flashing after the “freeze”, it’s actually a kernel panic. You’ll have to figure out a way to obtain the kernel panic information (like using tty1).

    After the “freeze”, try pressing the caps-lock key. If the LED turns on when pressing caps-lock, the Linux kernel is still functional. If the caps-lock key/LED does not work, the entire computer is frozen, and you are most likely looking at a hardware fault.

    From there, you basically need to make educated guesses of what to attempt in order to narrow down the issue and obtain more information. For example, try something like glxgears or vkgears to see if it happens with only one of those, or both (or neither).

    • atk007@lemmy.worldOP
      link
      fedilink
      arrow-up
      2
      ·
      1 month ago

      Well I did dmesg locally and via ssh, and neither showed any message before freezing. Similarly I did journalctl at next boot and no errors there either.

      Capslock doesn’t work, neither can I ping the host. The system is completely frozen and can only be reset with physical power button.

      Surprisingly glxgears worked fine and didn’t led to any freeze.

      • Corngood@lemmy.ml
        link
        fedilink
        arrow-up
        1
        ·
        1 month ago

        Sorry for the duplicate replies. Lemmy server drama…

        That’s a tricky one if you’re getting no info from the kernel. I think the reply above about system instability under load sounds promising. Throttling things down to test seems like a good idea.

  • Corngood@lemmy.ml
    link
    fedilink
    arrow-up
    2
    ·
    1 month ago

    Were you running dmesg on another screen or over ssh or something? I’d look in journalctl -b-1 after a reboot.

    Is it completely frozen or does it respond to pings etc?

    • atk007@lemmy.worldOP
      link
      fedilink
      arrow-up
      1
      ·
      1 month ago

      Did dmesg local and from ssh. Also journalctl also doesn’t give anything. PC is completely frozen as it doesn’t respond to pings at all

  • frongt@lemmy.zip
    link
    fedilink
    arrow-up
    2
    ·
    1 month ago

    Could still be a hardware issue that’s only triggered by certain code paths.

    Update everything, BIOS, firmware, microcode, drivers. If it still freezes, the next step will be swapping out hardware components.

  • Corngood@lemmy.ml
    link
    fedilink
    arrow-up
    2
    ·
    1 month ago

    Were you running dmesg on another screen or over ssh or something? I’d look in journalctl -b-1 after a reboot.

    Is it completely frozen or does it respond to pings etc?

    • atk007@lemmy.worldOP
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      1 month ago

      I did both. Local dmesg and from a ssh session as well, both without any entry on freezing. Journalctl is also with nothing out of the ordinary

      Also it’s completely frozen, no ping, no caps lock etc.

  • cadekat@pawb.social
    link
    fedilink
    arrow-up
    1
    ·
    1 month ago

    What video adapter is in there?

    Have you tried dmesg over SSH? Sometimes you can get an extra message or two if the display freezes before the entire kernel does.