Sample Header Ad - 728x90

Repeated server shutdown with same command - Unable to diagnose

0 votes
0 answers
62 views
**Problem** I have an unknow critical issue with my personal server, that causes shutdown (server turn off) I tried to watch severals elements, that i'll try to summarize here, with still no clues about the crash reason. As a (forced) beginner in server crash analysis, I am having difficulty finding other relevant test to investigate **Context** I'm working with crewai python librarie, and running the main command uv run crewai run causes me a systematic server complete shutdown. The time between executing the command and the crash varies but is mostly less than 10 seconds. In the longest cases, it can be a few minutes. I have also previously hosted a game server and experienced shutdowns even though the service was running with Docker with RAM/CPU limits. I cannot be certain that the cause is the same, but it seems probable. **My configuration** BMAX-Processeur B5A Pro AMD Ryzen 7 5825U, AMD Radeon vec, 16 Go, DDR4, 512 Go, SSD NVMe. Linux with Ubuntu server; Ubuntu 24.04.2 LTS release 24.04 (codename noble) **What I tried** - GPU / CPU / RAM
stress-ng --cpu 16 --vm 2 --vm-bytes 15G --gpu 16 --timeout 600s
--> No crash -
observation during crash command --> Far from max values (~33% RAM, some cores used, low swap usage) - Temperature sensors watch
-n 0.5 sensors
--> Temperature after stress test are higher than last moment before the crash ( Nothing apparent about the crash reason - Last boot logs
-b-1
--> Nothing apparent about the crash reason - Observation of
--> Nothing apparent about the crash reason - RAM test with
--> Test passed Don't hesitate to ask me for more details, information, logs, ... if needed, and thanks in advance for any help.
Asked by AnchorInk (1 rep)
Feb 15, 2025, 11:23 PM
Last activity: Feb 15, 2025, 11:42 PM