Hi
Before you start posting I want to inform that already tried to solve this problem with our Admin, with companies and professionals in the field, but none of them was able to identify and solve our problem.
To avoid spending more money without having a solution to our problem, I wonder if any friend of the Forum could help us identify the process that is causing problem of disk i/o.
The server has high load during various periods of the day and night, and I could see that when the load is high, the disk i/o is in 100%, as follows:
I got this result monitoring i/o every 15 seconds
We ask our datancenter that check the server disks for errors or problems, but the disks are well second datacenter experts.
Which way do I go now? Remembering that I am not an expert in the subject, I am trying to solve because professionals who have tried have failed.
Thanks
Before you start posting I want to inform that already tried to solve this problem with our Admin, with companies and professionals in the field, but none of them was able to identify and solve our problem.
To avoid spending more money without having a solution to our problem, I wonder if any friend of the Forum could help us identify the process that is causing problem of disk i/o.
The server has high load during various periods of the day and night, and I could see that when the load is high, the disk i/o is in 100%, as follows:
I got this result monitoring i/o every 15 seconds
Now I can not identify the process that is causing this problem, I passed that could be excessive connection problem in Dovecot, Apache or MySQL, but can't seem to find which one is the real problem or even if they are the problem.Time: 02:40:27 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 2.86 0.47 0.27 3.73 2.66 8.73 95.27 34027.00 1363.82 99.95
Time: 02:40:42 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.73 0.47 0.80 4.27 12.80 13.47 98.33 48013.47 789.58 100.01
Time: 02:40:57 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.00 0.47 0.47 3.73 4.80 9.14 99.22 27893.64 1071.57 100.01
Time: 02:41:12 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.00 0.47 0.33 3.73 3.73 9.33 96.26 47267.17 1250.17 100.01
Time: 02:41:27 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.00 0.47 0.27 3.73 2.13 8.00 94.25 44034.09 1363.64 100.00
Time: 02:41:42 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.00 0.53 0.53 6.40 45.87 49.00 89.86 70475.62 937.69 100.02
Time: 02:41:57 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 0.00 0.47 0.27 4.80 2.13 9.45 84.25 54866.55 1363.82 100.01
Time: 02:42:12 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 13.33 187.40 65.93 30.00 1609.60 1849.07 36.05 33.66 7428.83 10.43 100.01
Time: 02:42:27 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 15.40 199.53 91.40 70.67 1420.80 2176.53 22.20 5.98 38.75 6.16 99.86
Time: 02:42:42 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 1.40 120.45 35.38 113.19 353.36 1869.15 14.96 13.91 93.68 3.20 47.48
Time: 02:42:57 AM
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sda 0.00 12.60 0.67 6.80 5.33 155.20 21.50 0.13 16.88 7.71 5.76
We ask our datancenter that check the server disks for errors or problems, but the disks are well second datacenter experts.
Which way do I go now? Remembering that I am not an expert in the subject, I am trying to solve because professionals who have tried have failed.
Thanks