↑ Return to D00 Server

D05 Server Issues


Page no: D05

Common Server Issues

 

 Explanation

Video and Pics

Provider / Hardware Issues

Hetzner Internet Down

SNBCHF and all sites down. Dec 20, 2017.

Hetzner down. The downtime was 20 minutes

 

We can’t do anything in these cases. They do not provide any details for the problem.

All sites down

SNBCHF and all sites down. Jul 02, 2019.

After a 2 hour investigation early in the morning, the temperature of the server was to high and it stop responding. This was a hardware problem. In these cases, we need to contact support and explain what is the problem. You send them a ticket using a special support page.

Hetzner usualy needs 5 – 15 minutes to take care about the server, if you give them a proper brief what is the problem. In our case it was the CPU Fan. They give you the reason of the problem and what were the solution by email.

Harddisk Broken

One of the hard drives of the server was broken. Hetzner replace it with new one. Then we did a sync.

Low Space

No Space left on disk

When we store many backups on the server, the disk space is full and the site stops working. The site needs always some disk space for properly working.

The problem: We simply need to delete the old backups from the server.

Deletion can be done via BackupBuddy page or via FTP browsing wp-content/uploads. Then we need to delete the old backups and after 1 minute all the site services goes up again.

No Disc Space
D08 No Disc Space

- Click to enlarge

Delete Backup
D08 Backup

- Click to enlarge

Purging cache

Yes, It is not good idea to purge our caches everyday.

Do not purge cache every day.

Purge cache
Purge cache

- Click to enlarge

High CPU / Resources

Usually during the Crons from

Slow Template

Too many plugins: They make high activity.

Get rid of some plugins or improve template.

 

Many sites

When we have too many sites, then the performance may get bad.

Remove sites.

 

 

Yoast Term Optimizer

Yoast Term Optimizer does not work

 

It stops working due to lack of enough resources for this single process

 

The system guy makes the memory, executing time and other resources higher for single process.

 

Yoast Term Optimizer Resourcese Problem
D08 Yoast Term Optimizer Resourcese Problem

- Click to enlarge

Too Many Crons

We currently have an issue with the server again. Due to too many crons.

  • Lack of resources due to too many crons running
  • Stop some of them
  • Review the crons

Which Crons got deactivated –> Crons page

Crons SNBCHF
Crons SNBCHF

- Click to enlarge

CDN purge

Our server problem on 07/20 was caused by our CDN. We purged the whole cdn cache and their crawlers start caching all our static content again. The problem was that they started too many connections to the our server. The traffic was almost 800% higher than the usual traffic. This is problem of the CDN, they must send regular queries for fetching our content. This traffic is not seen by the Google Analytics, because it is directly to our CSS files. Nothing related with the site. So Google Analytics can track only when the code is installed on that page. It can’t track the traffic to our static files.

We fixed that issue changing the apache and nginx limits to higher. Now we can handle MaxCDN traffic. But it is good idea to purge only some files, not the whole cache. It will save us so many unnecessary queries.

Server load at that time on 07/20

Server load at that time on 07/20 - Click to enlarge

Pagination

We had a performance problem caused by missing pagination

We introduced pagination, for more see here.

External Activities

Bad bots on ecoblogs

I was looking on the server logs last two days and see that the performance issues are from bots or bad users from China and Turkey. About 10 Ip addreses are making 70% of all performance. They are making about 10 queries per sec. We need very urgent to revise the logs, get all bad bots and users and ban from our sites. This will lead to better performance, too.

Bad bots on ecoblogs
Bad bots on ecoblogs

- Click to enlarge

Bad Chinese bots

 

date: June 17, 2020

All sites are not responding. Services like Apache, MySQL, NGNIX do not work any more. Possibly Denial of Service attack.

We notice Chinese bots when we look at the statistics Unresolved IP that open the website. Currently the most visited ip addresses are ours. So no problem.

These IP addresses are banned via htaccess file.

Bad Chinese bots

.

How to ban IP addresses via htaccess

 

Block what
specific IP address Deny from 123.123.123.123
specific domain Deny from 123.123.123.123/255.255.255.0
multiple IP addresses Deny From 111.111.111.111 222.222.222.222 333.333.333.333
Block entire subnet Deny from 123.123

Then I start looking what is causing the problem. The problem was caused by economicblogs.ch and one infinity loop caused by one avatar.

The problem was because the name of the image contains utf-8 special symbols #U2013 and this was the reason for infinity loop. We changed the avatar with different and everything goes to normal.

Server improvements

Server update for better improvement

Hetzner microcode of the processor

As an PX61, PX61-SSD or PX61-NVMe Dedicated Root Server client, we are
writing to inform you that recently a problem was found in the microcode
of the processor used in these models.

The following servers may be affected:
PX61-NVMe #681263 (88.99.241.248) – SNBCHF and Blogs

For further information on this topic, please check the following
resources:

Debian
Tom’s Hardware

Ars Techinca

Based on current information, the problem is triggered by specific
micro-architectural command sequences on Intel’s 6th or 7th generation
CPUs with Hyper-Threading enabled. It can, in rare cases, cause
unpredictable behavior and, in more rare cases, data loss.

 

 

You can fix the problem by installing the latest Intel microcode update. For
several Linux distributions, you can find the necessary microcode update in the
repositories of your distribution. You can find more information regarding this
topic in our company wiki. (Please see link below.)

If you are using any other operating system (e.g. Windows), or if there
is no update for your Linux distribution yet, you can install the
microcode update by updating your BIOS.

Option: Bios Update by Hetzner

If you would like a BIOS update, please follow these steps:

Write a support ticket for every server that is affected. If you have a preferred time for
when our technicians should perform the update, make sure to include that in
the ticket. You may experience up to 30 minutes of downtime. In the support
ticket, you need to also clearly state that you agree to the downtime.

As an alternative, you can do the BIOS update yourself by following the
instructions in our company wiki:

 

If you perform the update on your own, and you cannot reach your server for
more than 10 minutes after completing the update, please do NOT perform a
reset. Instead, please write us a support ticket.

Summary:

  • We did a BIOS update, because old version of BIOS.
  • microcode update on the server

Estimate: 3 hours

Effort: 3 and a half hours

Weak Resources

How can they have 90 million articles without performance issues?

beforeitsnews.com

onenewspage.com


  1. Bigger servers. This is very important. They have very big servers, so the performance issues are not a problem for them.
  2. Ecoblogs is slow: so why don’t we have bigger servers?
    We can have big servers, but this will costs more money per month.
  3. Cache (see W53)
  4. Template and plugins (see below). What does this mean?

 

Buy bigger server.

 

 

 

Missed scheduled posts

Due to some bugs in WP cron, some of the posts were not published.

Missed scheduled posts

.

Posts Status

.

We installed the WP Missed Schedule plugin taking advice from this article.

They suggest to use this plugin for such a problems and the plugin will take care of everything. We will keep it for a while and if it is useful, it will remain.

WP Missed Schedule Posts

.

 

See more for Design