ubuntu20.04莫名死机,日志如下,萌新求救

系统安装、升级讨论
版面规则
我们都知道新人的确很菜,也喜欢抱怨,并且带有浓厚的Windows习惯,但既然在这里询问,我们就应该有责任帮助他们解决问题,而不是直接泼冷水、简单的否定或发表对解决问题没有任何帮助的帖子。乐于分享,以人为本,这正是Ubuntu的精神所在。
回复
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

ubuntu20.04莫名死机,日志如下,萌新求救

#1

帖子 tkffsyl » 2021-12-08 23:16

各位大佬,我的ubuntu总是卡死,隔几分钟就会崩溃一次,鼠标键盘突然没办法使用,只能重启,
刚刚从论坛中学会了看日志,日志是这样的
我猜想还是显卡驱动的问题,但是又害怕系统彻底打不开了,自己已经各种原因重装好几回了,这时间浪费不起了
求大佬看看怎么回事,给指条明路,小弟谢过了。 :Cry
12月 08 22:42:16 wyc kernel: loop2: detected capacity change from 0 to 8
12月 08 22:42:16 wyc systemd[1]: Mounted Mount unit for bare, revision 5.
12月 08 22:43:07 wyc kernel: NVRM: GPU at PCI:0000:02:00: GPU-ead04cf6-7e85-193>
12月 08 22:43:07 wyc kernel: NVRM: Xid (PCI:0000:02:00): 79, pid=0, GPU has fal>
12月 08 22:43:07 wyc kernel: NVRM: GPU 0000:02:00.0: GPU has fallen off the bus.
12月 08 22:43:07 wyc kernel: pcieport 0000:00:1d.4: AER: Corrected error receiv>
12月 08 22:43:07 wyc kernel: pcieport 0000:00:1d.4: PCIe Bus Error: severity=Co>
12月 08 22:43:07 wyc kernel: pcieport 0000:00:1d.4: device [8086:02b4] error >
12月 08 22:43:07 wyc kernel: NVRM: A GPU crash dump has been created. If possib>
NVRM: nvidia-bug-report.sh as root to collect thi>
NVRM: the NVIDIA kernel module is unloaded.
12月 08 22:43:07 wyc kernel: pcieport 0000:00:1d.4: [13] NonFatalErr >
12月 08 22:43:08 wyc systemd[1]: Reloading.
12月 08 22:43:09 wyc systemd[1]: Reloading.
12月 08 22:43:09 wyc systemd[1]: Mounting Mount unit for cloudcompare, revision>
12月 08 22:43:09 wyc kernel: loop3: detected capacity change from 0 to 418832
12月 08 22:43:09 wyc systemd[1]: Mounted Mount unit for cloudcompare, revision >
12月 08 22:43:09 wyc audit[5876]: AVC apparmor="STATUS" operation="profile_load>
12月 08 22:43:09 wyc kernel: audit: type=1400 audit(1638974589.795:30): apparmo>
12月 08 22:43:09 wyc audit[5878]: AVC apparmor="STATUS" operation="profile_load>
12月 08 22:43:09 wyc audit[5877]: AVC apparmor="STATUS" operation="profile_load>
12月 08 22:43:09 wyc kernel: audit: type=1400 audit(1638974589.907:31): apparmo>
12月 08 22:43:09 wyc kernel: audit: type=1400 audit(1638974589.907:32): apparmo>
lines 981-1003/1003 (END)


这里是我的配置 NVIDIA-SMI 470.82.00 Driver Version: 470.82.00 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A |
| N/A 45C P5 N/A / N/A | 593MiB / 2002MiB | 0% Default |
| | | N/A |

NVIDIA Corporation GP108M [GeForce MX250] (rev a1)

ubuntu是20.04
Linux version 5.11.0-40-generic (buildd@lgw01-amd64-010) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #44~20.04.2-Ubuntu SMP Tue Oct 26 18:07:44 UTC 2021
funicorn
帖子: 1301
注册时间: 2005-09-13 4:56
系统: Ubuntu Jammy Jellyfi
送出感谢: 0
接收感谢: 67 次

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#2

帖子 funicorn » 2021-12-09 7:38

你贴的部分不大能看出来
journalctl -p err
重新输出日志看看
这些用户感谢了作者 funicorn 于这个帖子:
tkffsyl (2021-12-14 16:40)
评价: 3.7%
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#3

帖子 tkffsyl » 2021-12-14 16:41

<pre>12月 14 16:37:17 wyc kernel:
12月 14 16:37:18 wyc bluetoothd[733]: <font color="#CC0000"><b>Failed to set mode: Blocked through rfkill (0x12)</b></font>
12月 14 16:37:18 wyc wpa_supplicant[778]: <font color="#CC0000"><b>nl80211: kernel reports: Attribute failed policy validation</b></font>
12月 14 16:37:18 wyc wpa_supplicant[778]: <font color="#CC0000"><b>Failed to create interface p2p-dev-wlp0s20f3: -22 (Invalid argument)</b></font>
12月 14 16:37:18 wyc wpa_supplicant[778]: <font color="#CC0000"><b>nl80211: Failed to create a P2P Device interface p2p-dev-wlp0s20f3</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 9 (Aggregate_power_percentage)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 9 (Aggregate_power_percentage)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 9 (Aggregate_power_percentage)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 9 (Aggregate_power_percentage)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 57 (UKNKNOWN)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 57 (UKNKNOWN)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 57 (UKNKNOWN)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 57 (UKNKNOWN)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported condition 57 (UKNKNOWN)</b></font>
12月 14 16:37:19 wyc thermald[776]: <font color="#CC0000"><b>Unsupported conditions are present</b></font>
12月 14 16:37:21 wyc kernel: <font color="#CC0000"><b>iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707</b></font>
12月 14 16:37:21 wyc kernel: <font color="#CC0000"><b>iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707</b></font>
12月 14 16:37:21 wyc kernel: <font color="#CC0000"><b>iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707</b></font>
12月 14 16:37:21 wyc kernel: <font color="#CC0000"><b>iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707</b></font>
12月 14 16:37:25 wyc bluetoothd[733]: <font color="#CC0000"><b>Failed to set mode: Blocked through rfkill (0x12)</b></font>
12月 14 16:37:32 wyc gdm-password][1377]: <font color="#CC0000"><b>gkr-pam: unable to locate daemon control file</b></font>
</pre>


您看下 我看不懂啊
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#4

帖子 tkffsyl » 2021-12-14 16:47

12月 14 16:37:17 wyc kernel:
12月 14 16:37:18 wyc bluetoothd[733]: Failed to set mode: Blocked through rfkill (0x12)
12月 14 16:37:18 wyc wpa_supplicant[778]: nl80211: kernel reports: Attribute failed policy validation
12月 14 16:37:18 wyc wpa_supplicant[778]: Failed to create interface p2p-dev-wlp0s20f3: -22 (Invalid argument)
12月 14 16:37:18 wyc wpa_supplicant[778]: nl80211: Failed to create a P2P Device interface p2p-dev-wlp0s20f3
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 9 (Aggregate_power_percentage)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 9 (Aggregate_power_percentage)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 9 (Aggregate_power_percentage)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 9 (Aggregate_power_percentage)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 57 (UKNKNOWN)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 57 (UKNKNOWN)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 57 (UKNKNOWN)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 57 (UKNKNOWN)
12月 14 16:37:19 wyc thermald[776]: Unsupported condition 57 (UKNKNOWN)
12月 14 16:37:19 wyc thermald[776]: Unsupported conditions are present
12月 14 16:37:21 wyc kernel: iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707
12月 14 16:37:21 wyc kernel: iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707
12月 14 16:37:21 wyc kernel: iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707
12月 14 16:37:21 wyc kernel: iwlwifi 0000:00:14.3: Unhandled alg: 0x3f0707
12月 14 16:37:25 wyc bluetoothd[733]: Failed to set mode: Blocked through rfkill (0x12)
12月 14 16:37:32 wyc gdm-password][1377]: gkr-pam: unable to locate daemon control file
您看看 我看不懂啊
funicorn
帖子: 1301
注册时间: 2005-09-13 4:56
系统: Ubuntu Jammy Jellyfi
送出感谢: 0
接收感谢: 67 次

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#5

帖子 funicorn » 2021-12-14 19:24

不对呀,你这怎么只有今天的呢,应该有以前日期的才对,这样才知道你死机的时候出了什么问题。
你运行
journalctl -p err > ~/err.log
然后把err.log附件贴上来看看?
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#6

帖子 tkffsyl » 2021-12-15 9:07

err.zip
(6.54 KiB) 下载 12 次
您看下,我是把当天死机后的日志复制出来了。大佬费心了。
funicorn
帖子: 1301
注册时间: 2005-09-13 4:56
系统: Ubuntu Jammy Jellyfi
送出感谢: 0
接收感谢: 67 次

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#7

帖子 funicorn » 2021-12-15 9:29

看了一下,死机应该是和gdm有关,可能是系统文件的用户权限出了问题,这是不正常的,但我也说不清原因。最近是否动过分区或系统文件?

你运行:

代码: 全选

stat / /var /tmp /etc
以及

代码: 全选

env
贴一下输出结果,主要是看看这些路径的uid和gid对不对
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#8

帖子 tkffsyl » 2021-12-17 13:23

File: /
Size: 4096 Blocks: 8 IO Block: 4096 directory
Device: 10306h/66310d Inode: 2 Links: 20
Access: (0755/drwxr-xr-x) Uid: ( 0/ root) Gid: ( 0/ root)
Access: 2021-12-17 21:19:31.283491450 +0800
Modify: 2021-12-08 22:38:48.874332570 +0800
Change: 2021-12-08 22:38:48.874332570 +0800
Birth: -
File: /var
Size: 4096 Blocks: 8 IO Block: 4096 directory
Device: 10306h/66310d Inode: 3932161 Links: 14
Access: (0755/drwxr-xr-x) Uid: ( 0/ root) Gid: ( 0/ root)
Access: 2021-08-19 18:47:19.000000000 +0800
Modify: 2021-12-08 22:38:53.774340983 +0800
Change: 2021-12-08 22:38:53.774340983 +0800
Birth: -
File: /tmp
Size: 4096 Blocks: 8 IO Block: 4096 directory
Device: 10306h/66310d Inode: 2359297 Links: 19
Access: (1777/drwxrwxrwt) Uid: ( 0/ root) Gid: ( 0/ root)
Access: 2021-11-21 20:32:59.414868520 +0800
Modify: 2021-12-17 13:21:11.440218901 +0800
Change: 2021-12-17 13:21:11.440218901 +0800
Birth: -
File: /etc
Size: 12288 Blocks: 24 IO Block: 4096 directory
Device: 10306h/66310d Inode: 4456449 Links: 132
Access: (0755/drwxr-xr-x) Uid: ( 0/ root) Gid: ( 0/ root)
Access: 2021-12-04 22:26:12.130015390 +0800
Modify: 2021-12-08 22:39:03.854372860 +0800
Change: 2021-12-08 22:39:03.854372860 +0800
Birth: -

-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------

SHELL=/bin/bash
SESSION_MANAGER=local/wyc:@/tmp/.ICE-unix/1638,unix/wyc:/tmp/.ICE-unix/1638
QT_ACCESSIBILITY=1
COLORTERM=truecolor
XDG_CONFIG_DIRS=/etc/xdg/xdg-ubuntu:/etc/xdg
XDG_MENU_PREFIX=gnome-
no_proxy=localhost,127.0.0.0/8,::1
GNOME_DESKTOP_SESSION_ID=this-is-deprecated
GTK_IM_MODULE=fcitx
CONDA_EXE=/home/wyc/anaconda3/bin/conda
_CE_M=
QT4_IM_MODULE=fcitx
LC_ADDRESS=zh_CN.UTF-8
GNOME_SHELL_SESSION_MODE=ubuntu
LC_NAME=zh_CN.UTF-8
SSH_AUTH_SOCK=/run/user/1000/keyring/ssh
XMODIFIERS=@im=fcitx
DESKTOP_SESSION=ubuntu
LC_MONETARY=zh_CN.UTF-8
SSH_AGENT_PID=1594
GTK_MODULES=gail:atk-bridge
PWD=/home/wyc
XDG_SESSION_DESKTOP=ubuntu
LOGNAME=wyc
XDG_SESSION_TYPE=x11
CONDA_PREFIX=/home/wyc/anaconda3
GPG_AGENT_INFO=/run/user/1000/gnupg/S.gpg-agent:0:1
XAUTHORITY=/run/user/1000/gdm/Xauthority
WINDOWPATH=2
HOME=/home/wyc
USERNAME=wyc
IM_CONFIG_PHASE=1
LC_PAPER=zh_CN.UTF-8
LANG=en_US.UTF-8
LS_COLORS=rs=0:di=01;34:ln=01;36:mh=00:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=40;31;01:mi=00:su=37;41:sg=30;43:ca=30;41:tw=30;42:ow=34;42:st=37;44:ex=01;32:*.tar=01;31:*.tgz=01;31:*.arc=01;31:*.arj=01;31:*.taz=01;31:*.lha=01;31:*.lz4=01;31:*.lzh=01;31:*.lzma=01;31:*.tlz=01;31:*.txz=01;31:*.tzo=01;31:*.t7z=01;31:*.zip=01;31:*.z=01;31:*.dz=01;31:*.gz=01;31:*.lrz=01;31:*.lz=01;31:*.lzo=01;31:*.xz=01;31:*.zst=01;31:*.tzst=01;31:*.bz2=01;31:*.bz=01;31:*.tbz=01;31:*.tbz2=01;31:*.tz=01;31:*.deb=01;31:*.rpm=01;31:*.jar=01;31:*.war=01;31:*.ear=01;31:*.sar=01;31:*.rar=01;31:*.alz=01;31:*.ace=01;31:*.zoo=01;31:*.cpio=01;31:*.7z=01;31:*.rz=01;31:*.cab=01;31:*.wim=01;31:*.swm=01;31:*.dwm=01;31:*.esd=01;31:*.jpg=01;35:*.jpeg=01;35:*.mjpg=01;35:*.mjpeg=01;35:*.gif=01;35:*.bmp=01;35:*.pbm=01;35:*.pgm=01;35:*.ppm=01;35:*.tga=01;35:*.xbm=01;35:*.xpm=01;35:*.tif=01;35:*.tiff=01;35:*.png=01;35:*.svg=01;35:*.svgz=01;35:*.mng=01;35:*.pcx=01;35:*.mov=01;35:*.mpg=01;35:*.mpeg=01;35:*.m2v=01;35:*.mkv=01;35:*.webm=01;35:*.ogm=01;35:*.mp4=01;35:*.m4v=01;35:*.mp4v=01;35:*.vob=01;35:*.qt=01;35:*.nuv=01;35:*.wmv=01;35:*.asf=01;35:*.rm=01;35:*.rmvb=01;35:*.flc=01;35:*.avi=01;35:*.fli=01;35:*.flv=01;35:*.gl=01;35:*.dl=01;35:*.xcf=01;35:*.xwd=01;35:*.yuv=01;35:*.cgm=01;35:*.emf=01;35:*.ogv=01;35:*.ogx=01;35:*.aac=00;36:*.au=00;36:*.flac=00;36:*.m4a=00;36:*.mid=00;36:*.midi=00;36:*.mka=00;36:*.mp3=00;36:*.mpc=00;36:*.ogg=00;36:*.ra=00;36:*.wav=00;36:*.oga=00;36:*.opus=00;36:*.spx=00;36:*.xspf=00;36:
XDG_CURRENT_DESKTOP=ubuntu:GNOME
VTE_VERSION=6003
CONDA_PROMPT_MODIFIER=(base)
GNOME_TERMINAL_SCREEN=/org/gnome/Terminal/screen/1d26325b_1f00_41bf_b60f_1852ed2b98aa
https_proxy=http://127.0.0.1:7890/
INVOCATION_ID=9cf0fc6fa90b499f9211c52d32c74270
MANAGERPID=1375
CLUTTER_IM_MODULE=fcitx
LESSCLOSE=/usr/bin/lesspipe %s %s
XDG_SESSION_CLASS=user
TERM=xterm-256color
LC_IDENTIFICATION=zh_CN.UTF-8
_CE_CONDA=
LESSOPEN=| /usr/bin/lesspipe %s
USER=wyc
NO_PROXY=localhost,127.0.0.0/8,::1
GNOME_TERMINAL_SERVICE=:1.96
CONDA_SHLVL=1
DISPLAY=:1
SHLVL=1
LC_TELEPHONE=zh_CN.UTF-8
HTTPS_PROXY=http://127.0.0.1:7890/
HTTP_PROXY=http://127.0.0.1:7890/
QT_IM_MODULE=fcitx
LC_MEASUREMENT=zh_CN.UTF-8
http_proxy=http://127.0.0.1:7890/
CONDA_PYTHON_EXE=/home/wyc/anaconda3/bin/python
XDG_RUNTIME_DIR=/run/user/1000
CONDA_DEFAULT_ENV=base
ALL_PROXY=socks://127.0.0.1:7891/
LC_TIME=zh_CN.UTF-8
JOURNAL_STREAM=8:48230
XDG_DATA_DIRS=/usr/share/ubuntu:/usr/local/share/:/usr/share/:/var/lib/snapd/desktop
all_proxy=socks://127.0.0.1:7891/
PATH=/home/wyc/anaconda3/bin:/home/wyc/anaconda3/condabin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin
GDMSESSION=ubuntu
DBUS_SESSION_BUS_ADDRESS=unix:path=/run/user/1000/bus
LC_NUMERIC=zh_CN.UTF-8
_=/usr/bin/env


您来看下,分区应该没动过。
funicorn
帖子: 1301
注册时间: 2005-09-13 4:56
系统: Ubuntu Jammy Jellyfi
送出感谢: 0
接收感谢: 67 次

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#9

帖子 funicorn » 2021-12-17 15:06

嗯,这部分是没问题的。看来GDM的报错信息是重启前发生的,不是死机的原因。

很遗憾帮不了你了。结合你说的情况和日志像是内核错误,很难诊断。

怀疑是nivida驱动问题的话,可以试试从470降回460,把连接线从DP换为HDMI,或者改用wayland环境。另外下次死机时,可以试试从ssh登陆,看看能不能重启gdm。
tkffsyl
帖子: 7
注册时间: 2021-12-08 23:06
系统: linux
送出感谢: 2 次
接收感谢: 0

Re: ubuntu20.04莫名死机,日志如下,萌新求救

#10

帖子 tkffsyl » 2021-12-17 21:17

谢谢您了 实在不行我就在重装一下了
回复