After multiple reflashes, making custom reset scripts, updating firmware, toggling various things - disabling (or rather, not using) CakeQOS has made it stable. Could be coincidence with something else, but its been stable since.
I kept enabling it having seen that it went from an ATM install...