I'm wondering how I could monitor spinlocks. At my client, we have cpu soft lockup failure, for which, if I understand well, spinlock is a likely cause.
Different team use that server for predictive modeling using R, Python and SAS, meaning we often have many unsupervized processes running in parallel, possibly with multiprocessing librairies.
Monitoring the number of spinlocks or, even better, which processes used them, might help in validating or invalidating them as a cause for our frequent failures (5 during the last 3 weeks).
Is there any way to monitor them? If not, how could we know what would be causing those soft lockups?
Asked by laurent exsteens
(111 rep)
May 31, 2016, 08:39 AM
Last activity: May 31, 2016, 02:07 PM
Last activity: May 31, 2016, 02:07 PM