Hi Jorge,
I didn't see your responses until today! I guess I got some clarity from our bugzilla discussions.
On Fri, Mar 16, 2018 at 09:43:04AM +0000, Jorge Martínez López wrote:
I did some research and found the following kernel bug:
https://bugzilla.kernel.org/show_bug.cgi?id=196683
Fedora has CONFIG_RCU_NOCB_CPU=y in the 4.15.8 kernel configuration so I added "rcu_nocbs=0-7" to the boot parameters and it has been running stable for a while. I have also added "nopti" as well as there is some anecdotal evidence it improves stability but I'm not sure about that.
I think my tracebacks are very different. That said, it also seems to me I'm having freezes due to several unrelated reasons, and AMDGPU is probably one among many.
There is also some discussions in the bug page about old PSUs not providing good enough low voltage, AMD is recommending running a newer PSU (post-Haswell) but for the time being the boot config is working for me.
This is an interesting point, but very difficult to test :-|.
I haven't been able to debug my issues successfully as nothing useful really shows up in the journal. I was hoping someone could suggest a way so that I could get more information to file a more specific bug report.
Any thoughts anyone?
TIA,