High CPU spikes
Posted: Fri Jul 05, 2024 12:57 pm
Hi
We have an ongoing problem with our server that runs Switch.
Out of nowhere, we can get CPU spikes up to 100% which then stop everything from working.
At this point, Switch will be taking up a high portion of the CPU and many other processes seem to then be accounting for the rest.
In normal working times, the CPU will sit anywhere in the 5-15% area with Switch taking a small part of that.
We then have to pause the processing (140 flows), stop Switch and reboot the server. Sometimes this works. Other times, we have to do this and then leave the machine to just settle down which it will do after a while.
Other times we have to stop all flows, and start then individually until all is back up and running.
I can't see any logic as to what is going on, or what causes it. It happens randomly by the look of it.
I am not sure if / how we can tell if it is actually Switch causing the problem OR is there something else happening on the server that's causing the slow down, and then causing Switch to slow down i.e. network speed?
Any help/suggestions as to the best way to approach resolving this would be greatly appreciated.
Our IT guys have suggested upping the spec of the server, but considering it will be running fine for days / weeks with no problem and then just spikes out of nowhere, I can't see this will help? If it was a server resource issue, I would assume it would be slow all the time and not just a sudden jump?!
Thanks for any help with this. It's getting very frustrating at the moment as we are wasting hours of time each instance of this happening.
We have an ongoing problem with our server that runs Switch.
Out of nowhere, we can get CPU spikes up to 100% which then stop everything from working.
At this point, Switch will be taking up a high portion of the CPU and many other processes seem to then be accounting for the rest.
In normal working times, the CPU will sit anywhere in the 5-15% area with Switch taking a small part of that.
We then have to pause the processing (140 flows), stop Switch and reboot the server. Sometimes this works. Other times, we have to do this and then leave the machine to just settle down which it will do after a while.
Other times we have to stop all flows, and start then individually until all is back up and running.
I can't see any logic as to what is going on, or what causes it. It happens randomly by the look of it.
I am not sure if / how we can tell if it is actually Switch causing the problem OR is there something else happening on the server that's causing the slow down, and then causing Switch to slow down i.e. network speed?
Any help/suggestions as to the best way to approach resolving this would be greatly appreciated.
Our IT guys have suggested upping the spec of the server, but considering it will be running fine for days / weeks with no problem and then just spikes out of nowhere, I can't see this will help? If it was a server resource issue, I would assume it would be slow all the time and not just a sudden jump?!
Thanks for any help with this. It's getting very frustrating at the moment as we are wasting hours of time each instance of this happening.