Sample Header Ad - 728x90

using top to identify cpu core number with qsub pbspro

0 votes
1 answer
554 views
I have a shared memory server with PBSpro installed as the job manager. There are 320 cores total, and pbs is configured so there is 1 job queue having 30 of the 32 cpu's so 300 physical cores to be divided up among users, with 2 cpu's or 20 cores left for head node operation. some software will run on or parallelize over 100+ cores, and i'd like to verify N processes from user A are on unique and separate core numbers than the M processes from user B. If I use top I can do hit F to get into sort menu hit J to sort by P = Last CPU used (SMP) this gives me a P column in the top output having core number, but on a 300 core system I can only get around 70-80 rows before I run out of screen, on a 1920x1200 monitor. I can start shrinking font size in the terminal window but in the end I won't be able to see up to 300 rows, not that I can even visually process all that with top updating every 1..3 seconds. My goal is to quickly and easily - verify users are running stuff in the job queue within the correct core numbers, and not on the head node - verify for a given cpu core that is at 100%, or anything over 50%, that only one process from one user is running on it. I want to make sure that if user A with programA.x is on core #234, user B with anything.x is NOT on core #234. what's the best way to do this, when a single image shared memory server has many cores?
Asked by ron (8647 rep)
Mar 28, 2018, 08:24 PM
Last activity: Apr 17, 2018, 11:44 AM