March 27, 2023

The Complete Technical SEO Checklist That Every Marketer Needs This 2023

Using Google Analytics or Bing webmaster tools is only a tiny part of the strategy to become one of the important pages.

Many checklists can be used at your disposal to improve organic traffic:

  • Link building checklist
  • Keyword research checklist
  • On page SEO checklist
  • Technical SEO checklist

Today we'll focus more on the Technical SEO checklist

In an increasingly competitive digital landscape, performing a comprehensive technical SEO audit is crucial for optimizing your website's performance and improving search engine rankings.

What exactly is a technical SEO audit?

A Technical SEO audit is a process meant to identify and fix the technical issues that could make it more complicated for Google to crawl, index, and rank your site.

Poor site architecture, broken links, slow page load times, or display problems on mobile-friendly platforms are common problems frequently found during a technical search engine optimization audit.

The technical SEO audit is an essential part of a website's SEO strategy and should be one of the first things you look into to improve your visibility in Google search console results.

Why Having a Technical SEO Checklist is Necessary?

Technical SEO audits are crucial because, even if you've spent a lot of time producing excellent content, problems with your website's crawlability and indexability may prevent users from seeing it.

However, even if users can find your site online, performance-related technical factors may degrade its rankings. Because page load speed is a ranking factor, a slow website is less likely to appear near the top of SERPs (search engine results pages).

Internet users are even less patient than Google crawlers and will leave your own website if it takes too long to load.

Similarly, a poorly organized website can cause confusion among visitors. An easy-to-navigate website improves the user experience, which produces more leads.

During a technical SEO audit, you might learn that mobile users encounter many issues when navigating your website. Such problems could result in a terrible loss of revenue, given that mobile devices account for more than half of all web traffic worldwide.

This ultimate technical SEO checklist for 2023 will guide you through every essential aspect of technical SEO to ensure your site stays ahead of the game.

Your Ultimate Technical SEO Checklist from A to Z!

A. Set up a robots.txt file to guide search engines in crawling and indexing your websites.

When a web crawler visits your site, it will first look at the /robot. The Robot Exclusion Protocol is also referred to as the robots.txt file. According to this protocol, specific web robots can be allowed or prohibited from crawling your website, including particular sections or even individual pages. Use a no-index robots meta tag to stop bots from indexing your website. These two scenarios will be discussed.

It would be best to prevent specific bots from ever crawling your website. Unfortunately, some of the bots on the internet have bad intentions; they will scrape or flag your site for duplicate content or spam your community forums. You'll use your robot if you see this bad behavior. You might envision a robot in this scenario. Use the robots.txt file to protect yourself from malicious internet bots.

Regarding indexing, search engine robots trawl your website to gather information and identify keywords that will help them match your web pages with pertinent search queries. You have a crawl budget, though, and you want to avoid blowing it on extraneous data, as we'll cover later. Therefore, pages like a thank you, a page from an offer, or a login page that doesn't help search bots understand what your website is about may be excluded.

Regardless of the txt protocol, your robot will differ depending on what you want to achieve.

B. Create an XML sitemap to help search engines discover and index your content.

Site structure belongs in an XML Sitemap, which aids search engines in comprehending and indexing your websites. Consider it a website map. You'll upload your sitemap to Bing and Google Search Console as soon as it's ready. As you add and remove websites, remember to keep your sitemap current.

C. Avoid Index Bloating

Index bloat is when your website has excessive low-quality pages that the search engines index. In search engine indices, when their information retrieval process becomes less effective, the outcome of processing this excess content is comparable to bloating in the human digestive system.

Even without your knowledge, index bloat can make your life challenging. To get to the pages you want Google to index, they have to wade through a lot more content than is necessary for this puffy and uncomfortable situation (which is frequently low-quality and internal duplicate content).

Imagine that Google visits your XML sitemap to discover 15,000 pages, then crawls all of your pages to find even more via internal linking, and finally chooses to index 50,000 URLs.

But don't worry; determining your indexation rate to check for index bloat can be a quick and easy check. It would help to compare the pages you want to be indexed with those that Google is already indexing.

The goals are to find that disparity and use it to guide the best course of action. We have the following two options:

Good-quality content = Maintain indexability.

Low-quality content (thin, redundant, or paginated) needs to be indexed, so no index is used.

You will discover that index bloat is often resolved by adding a "NO INDEX" meta tag, which removes a sizable number of pages from the index. However, it is also possible to find overlooked pages when creating your XML sitemap and add them to your sitemaps for better indexing through this indexation analysis.

You can employ your robots as well. Use a robots.txt file to prevent indexing of specific parts of your website. This is the best option if you want to exclude whole sections of your website, such as your search pages (if they are the same), reviews, size charts, orphaned pages, paginated pages, etc.

However, the best thing you can do is make sure you produce original content whenever you can.

D. HTTP Tags

Using the HTTP protocol, it is possible to fetch resources like HTML documents. It serves as the basis for all data exchange on the Web and is a client-server protocol; the recipient makes meaningful requests, typically through a web browser. An entire document is remade from various sub documents retrieved, such as layout descriptions, text, videos, scripts, images, and many more.

E. Serving www. and non-www. Pages

Sometimes, a domain without a www is called a "naked" domain. Additionally, www domains can function as a hostname and be connected to multiple subdomains.

Having a canonical domain indicates that you have designated it as your "official" domain in the eyes of search engines. Selecting one (and using it consistently) tells search engines and users alike a more dependable and trustworthy impression.

The www prefix, another sub-domain when the internet started, was used as the starting point for website URLs. When conducting online searches, people eventually started entering website names without the www. Webmasters noticed this pattern and changed their domain strategies to be more straightforward.

A www URL is no different than a non-www URL in terms of functionality. There is, however, a slight distinction between the two in terms of technicality.

Search engines see them as distinct entities, which you need to know.

F. Empty category pages in online commerce.

If you no longer have any products, your e-commerce website's categories (or listing product pages) may occasionally be empty.

For instance, a sold-out event, a technical problem, or improper back-office handling could be the cause.

The HTTP headers may change (301/404) or remain in 200, depending on your CMS, the optimizations used, and the SEO decisions made.

Sitemap and Monitoring from OSEOX will notify you if the HTTP header changes.

It's crucial to understand that the page is empty if the HTTP headers remain at 200 (this is frequently excellent for SEO).

G. eCommerce Website Sorting & Views

Users can reduce the number of products displayed in a given category by using filters on e-commerce websites, which are given parameters on a prepared list. The following are the filters used in e-commerce the most frequently:

  • Brand.
  • Colour.
  • Size.
  • Type.
  • Price.

Depending on the names and specializations of a specific brand or category, product sorting options in online stores typically vary. This solution undoubtedly improves the user experience because it saves users time from having to browse the entire product selection.

In contrast, sorting options let you arrange the products in a category under specific criteria without adjusting the number of displayed items. They could also be referring to already filtered results. These sorting options are used the most frequently:

  • Prices.
  • Feedback from customers.
  • Popularity.
  • Sales and news.

H. Meta Robots Tag

Robot meta directives, also known as "meta tags," are snippets of code that tell web crawlers how to index or crawl a page's content. Robot meta directives, as opposed to Txt file directives, which offer recommendations for how bots should crawl a website's pages, provide more detailed instructions on how to crawl and index a page's content.

Robot's meta directives come in two flavours: those sent by the web server as HTTP headers (like x-robots-tag) and those embedded in HTML pages (like the meta robots tag). Similar specifications (i.E. the crawling or indexing directives a meta tag offers, such as "no index" and "no follow" in the example mentioned above) can be used with both meta robots and what is different is how crawlers are informed of the parameters in the x-robots-tag.

Crawlers are given instructions by meta directives on how to crawl and index the data they discover on a webpage. However, their parameters are solid recommendations for crawler indexation behaviour if bots find these directives, just like with robots. Crawlers don't have to abide by your meta directives for txt files, so some malicious web robots will likely disregard them.

The parameters that are used in robots' meta directives and which search engine crawlers adhere to are not case-sensitive, but be aware that it's possible that some search engines will only follow a portion of them or will apply some directives slightly differently.

I. X-Robots Tag

In contrast to the meta robots tag, which allows you to control indexing behaviour at the page level, the HTTP header field x-robots enable you to manage both the overall indexing of a page and the indexing of particular page elements.

The x-robots-tag directive offers significantly more flexibility and functionality than the meta-robots tag, even though you can implement the same indexation directives. Using regular expressions, running crawl directives on non-HTML files, and setting global parameters are all explicitly permitted by the x-robots.

The x-robots-tag can be used in the following situations:

  • It regulates the indexation of non-HTML content (such as flash or video).
  • Blocking indexation of a specific page element (such as an image or video) but not the entire page.
  • Managing indexation without access to a page's HTML, specifically to the. <head> if your website uses a global header that cannot be altered.
  • Including criteria to determine whether a page should be indexed (e.g., index the user's profile page if they have left comments more than 20 times).

J. Canonical Tags

By scanning the HTML source code for canonical tags, Google and other search engines can determine which page is the canonical version.

When you have duplicate content from a page, the canonical URL is the URL for the master copy of that page.

Given that the duplicate pages are located on various URLs, you can specify a canonical URL to inform Google of the original page's location. Or the most appropriate example.

In cases where a page appears twice at different URLs on your site, the canonical version is the most accurate one.

K. HTTP Status Codes

Standard response codes provided by web servers on the internet are known as HTTP status codes, also known as browser/internet error codes. When a web page or other resource fails to load correctly, the codes assist in pinpointing the issue's root cause.

The HTTP status line's common name, consisting of the HTTP reason phrase and the HTTP status code, is "HTTP status code.".

Client Error 4xx.

This category contains requests for web pages or other resources that have poor syntax or cannot be fulfilled for some other reason, most likely due to the client's (the web surfer's) fault.

Server error code 5xx.

This category includes cases in which the website's server recognizes a request for a web page or other resource but cannot fulfil it for some reason.

In addition to 4xx and 5xx, there are other HTTP status codes. Additional codes like 1xx, 2xx, and 3xx indicate information, success, or a redirection, respectively. You shouldn't receive a browser warning for these other types because they are not errors.

L. Moved Permanently with a 301 status code

It's generally not a good idea to change the URLs of resources. However, there are times when changing a resource's URL is necessary and cannot be avoided.

Redirection is sending a request from one URL to another URL. The request method (as well as the request body), as specified for Status 301, cannot be changed during redirection.

One of the redirection-related status codes is HTTP Status 301, which denotes that the requested resource has been permanently moved to the URL specified in the Location header. Additionally, all upcoming requests must use the new URI.

M. 302 Found and Temporarily Moved

When a web server services a page, HTTP status codes show what kind of response to expect. A 302 temporary redirect indicates that the user was forcibly redirected from their original URL to another page, as do all status codes in the 300s.

If the user's requested page receives a 302 HTTP status code, it means that it has temporarily moved. The user is redirected to the new site with a 302 "Found" or "Moved Temporarily" turn, even though the original location is still being used for requests.

When you want to direct users away from 404 page-not-found errors or pages that aren't active now but will be in the future, these redirects are helpful. When a product is out of stock or in season, online retailers might find a 302 temporary redirect useful.

E-commerce sites can use temporary redirects to block users from visiting these temporarily unavailable pages while still allowing them to be accessed once the products are available. You can apply and remove 302 Moved Temporarily redirects as needed because they are simpler to implement than 301 Moved Permanently redirects.

However, despite being simpler to implement, 302 redirects are typically a terrible choice for SEO because search engine bots view the redirect as temporary. The ranking power of the original site is lost when pages are redirected using a 302 directive. Three-hundred-one redirects are a better SEO option because they preserve most of the actual ranking factor power.

N. Not found, or Error 404

An adequately optimized page can encourage visitors to look around your website more and lessen the likelihood that the lead will be lost.

An HTTP standard response code known as a 404 error, or Page Not Found, indicates that a webpage has been moved or deleted and that a redirect to the new page URLs has not been set up.

Google announced that having 404 pages won't automatically lower your site's ranking.

Four hundred four errors have the potential to affect organic search rankings adversely.

PageRank cannot be built on 404-infected pages. Complex calculations determine the internal value of the blue_widget.html page if 100 of your pages link to it. One hundred websites are still linking to that page after it has been deleted (the URL will now return a 404 error), but the link juice is lost in a black hole, so the page no longer gains or transmits PageRank.

Internal pages will not receive link power from external websites passed onto blue_widget.html.

Unintentional 404s can reduce your search traffic even if the previous two points are ignored; a 404 page cannot be ranked for the previously present content. If you have ten pages that receive 100 visitors per day and delete just 1 page, you'll see a traffic loss almost immediately.

O. Soft 404s

There is no official status code for a soft 404 error. Since there is no soft 404 status code, the server does not send a 404 response to the browser.

The term "soft 404" refers to a situation where the server displays a webpage and responds with a 200 OK status code, indicating success, even though the webpage or content is missing.

There are four common causes of a soft 404:

A server sends a 200 OK response even though a webpage is missing.

This soft 404 occurs when a page is missing but the server configuration directs users to the home page or a specific URL.

Although the page is no longer there, the publisher has taken action to satisfy the request for the missing page.

Content is "thin" or lacking."

When the content is either absent or barely present (a. K. A. Thin content), the server will respond by returning a 200 status code, indicating that the request for the page was successful.

Search engines, however, refer to this as indexing soft 404s when they fail to index a webpage request.

The homepage is redirected when a page is missing.

A 404 error response can lead some people to believe a problem exists.

Therefore, a publisher may redirect the missing page to the homepage even though that was not requested to stop the 404 error responses.

This type of 404 error is called a soft 404 by Google.

A custom website was redirected from the missing page.

Google labels missing pages that redirect to a specially designed website with a 200 status code as soft 404s.

P. Crawl Errors

Your content won't rank if Google (or any other search engine, for that matter) can't crawl it.

Crawl errors are issues the Googlebot runs into while attempting to crawl the information on your website.

A crucial step when conducting a technical SEO site audit is checking the Index Coverage report on Google Search Console to see if your content has been crawled.

Two categories of crawl errors are distinguished:

  • Site errors indicate that Google could not crawl your entire website; you don't want these.
  • URL errors are the inability of Google to crawl a particular URL on your website.

Remove low-quality or duplicate pages and block unnecessary resources on the same website. Your site's pages and resources that any search engine will crawl are called the "crawl budget." Make sure you prioritize your most crucial pages for crawling because the budget is finite.

Here are some pointers to help you make the most of your crawl budget:

  • Canonicalize duplicate content or remove them.
  • Any broken links should be fixed or changed.
  • Make sure that your CSS and JavaScript files can be crawled.
  • Regularly check your crawl statistics and watch for any unexpected drops or gains.
  • Check to see if the bot or page you've blocked from crawling was intended to be stopped.
  • Be sure to submit your sitemap to the appropriate webmaster tools and keep it updated.
  • Remove any outdated or unnecessary content from your website.
  • The number of pages on your site may increase dramatically if you use dynamically generated URLs, so beware.

Q. Page Speed Optimization

Page speed measures how quickly content on your page loads.

Site speed is the page speed for a representative sample of page views, which frequently needs to be clarified with page speed. Page speed can be measured in terms of "page load time" (the time it takes for a page to display its content fully) or "time to first byte" (how much time your browser needs to wait before the web server sends it its first byte of information).

Google has stated that one of the factors used by its algorithm to rank pages is site speed, which also results in page speed. And studies have shown that when Google evaluates page speed, it may precisely measure time to the first byte. Additionally, a slow page speed limits the number of pages that search engines can crawl with the budget available for crawling, which hurts your indexation.

Page speed also affects user experience. Higher bounce rates and shorter average time on the page are correlated with slower loading times. Additionally, demonstrated to hurt conversions are longer load times. See our case study to learn how we greatly increased a restaurant website's page speed by putting numerous strategies into practise, such stagger photo loading times, reduce photo size, and more.

Advanced Technical SEO Checklist

R. Implement structured data and schema markup to enhance your search engine listings and improve click-through rates.

Structured data is a method of associating a name with a value that aids search engines in classifying and indexing your content. HTML5 is compatible with microdata, one type of structured data. A specific set of accepted definitions for microdata tags is provided by the Schema.org project.

Numerous studies and debates have focused on the question of whether data that is structured and schema markup has an impact on rankings. There currently needs to be solid proof that the schema markup raises rankings. However, there are some hints that click-through rates will be higher for search results with more prosperous snippets that are more comprehensive (such as those made using schema).

S. Use Google's Structured Data Testing Tool to validate your markup and ensure proper implementation.

On the page where the information applies, structured data is coded using in-page markup. Its contents are described by the structured data on the page. Even if the notification is accurate, avoid adding structured data about the information hidden from the user or creating blank or empty pages to hold structured data.

Mobile SEO

Mobile SEO optimizes websites for search engines while ensuring they display perfectly on mobile devices like smartphones and tablets. Due to the growing popularity of portable devices, web admins need an accelerated mobile page strategy. After all, more than 50% of Internet users regularly use mobile devices to browse websites. Mobile-friendly websites are already given preference by Google.

According to Google, mobile-friendly websites typically function in one of three configurations:

T. Responsive web design.

Your mobile site will have the same HTML code and content for the same URL when you use responsive web design, regardless of the user's preferred device. To inform the Internet browser how to adjust the content, use the meta name= "viewport" tag in the site's source code. The display options will then change to match each visitor's particular screen size.

U. Adaptive Serving.

According to the user's device, dynamic serving configurations have the server respond to the same URL with different HTML and CSS codes. For this, you'll need to effectively use the Vary HTTP header to signal changes based on the page requests made by the user-agent. Valid titles instruct the browser how to display the content and speed up the Google bot's recognition of your website's mobile-optimized content.

V. Different URLs.

As the name implies, this setup configuration entails having various URLs for your website to display your content on different mobile gadgets successfully. Each URL includes a special HTML code for the appropriate screen size. In general, you can anticipate the following when you choose one of the three mobile SEO configurations for your website.

  • More traffic to the website.
  • Enhanced user encounters.
  • Higher rates of conversion.
  • Increased time spent on your website.
  • Fewer bounces.
  • Quicker page loads.
  • Market advantages due to competition.
  • Increased customer interaction.
  • Enhanced search engine performance.

W. Server Log File Analysis

Analysis of web server logs can reveal vital information about your web servers. To complete the process, log files produced by your web servers must be gathered, processed, and examined.

By doing this, you can collect detailed information about user or search engine bot requests made to your server, including any issues related to those requests. While some log monitoring can be done manually, an automated logging tool is your best bet for quick and in-depth analysis.

An organization may have thousands of logs generated every second to perform search engine optimization or SEO log analysis. However, understanding web server logs, access logs, and error logs is the first step in comprehending web server log analysis.

An entry is made in the log when a web server takes some action. A history of webpage requests with extensive information on each request can typically be found in web server logs. A single file containing this information may also be used or divided into separate records, such as access and error logs. A web server log typically includes request date and time, client IP address, requested page, bytes served, HTTP code, referrer, and user agent.

The access logs will include information about the time and location of the server requests, the response codes, and the requested pages if the collected data is divided into multiple logs. As opposed to this, error logs also include other diagnostic information and details about any errors the server may have encountered while handling the requests.

X. HTTPS

The website you are visiting has a secure connection thanks to HTTPS. You have probably seen this in action; locate the lock icon on the left side of the browser's address bar. When the lock is closed, the connection is safe. If it's accessible or another icon or message appears, it's not secure and open to attack. When you access a website through an insecure connection, hackers and other criminals may be able to access the information you send, including your password and email address.

Your browser queries a website for its IP address when you enter a URL in the search bar, for example, 123.456. 7.89. The address a site receives online is this number. To find the right website, the browser connects to this number. Everything is done in plain sight, and there is no evidence of encryption, making it possible for anyone to observe this traffic's intercept. Therefore, the login information you provide—username and password—is sent in plain text when you try to access a website using an HTTP connection. That's bad, I assure you. Consider what would occur if you connected to your bank in this manner.

HTTPS protects the process. It encrypts the connection between the browser and the website to ensure no one can intercept the data sent between them. A so-called SSL certificate is required for every website that wants to secure itself. The browser examines the site's certificate and confirms its legitimacy with the organization that issued it. If you wish, please click on the lock icon to reveal who issued the certificate. Sites secure your login process, personal information, activity on a site, and the websites you visit using HTTPS.

Y. JavaScript SEO

Technical SEO includes JavaScript SEO, which aims to make it simpler for search engines to crawl, render, and index websites created with JavaScript.

The following are typical tasks for JS SEO:

  • Enhancing JavaScript-injected content.
  • Lazy loading is implemented correctly.
  • Adopting best practices for internal linking.
  • Solving and preventing JavaScript problems.

Z. Site Structure

The pages on your website are numerous. These multiple pages must be set up so any search engine can easily find and crawl them. That's where your website's information architecture, also known as the site's structure, comes into play.

Your site architecture determines how you arrange the pages on your website, much like how a building's architectural design informs its construction.

Pages related to one another are grouped; for instance, your blog homepage links to specific blog posts, and those individual blog posts each link to a different author page. Thanks to this structure, any search engine can better understand the relationships between your pages.

Your site's architecture should be influenced by how important each page is. Search engines will give page A more weight. The closer it is to your homepage, the more pages link to it, and the more link equity those relevant pages have.

For instance, a link to Page A from your homepage conveys more importance than a link from a blog post. The more links there are to Page A, the more "significant" that page is to search engines.

Other Important factors to consider in your technical SEO checklist

  • Use long tail keywords and relevant keywords
  • Orphan Pages
  • Content marketing
  • Fix broken links and remove duplicate content
  • Meta descriptions

Now that you know the technical SEO basics checklist make sure your site can climb the Google rankings using Singapore SEO services.

The company that can give helpful SEO information to bring organic traffic to your website is Khepri Digital. Whether you're looking for a free tool for a site audit, off page SEO, on page SEO, or even technical SEO, they've got you covered!

How many pages can they help you with? It can be one page or even all of your important pages. Link building with internal and external links, keyword research, optimizing website speed, and your robots.txt file; ensure your site can top the search rankings with Khepri.

More from Khepri Digital
SHARE
ABOUT THE WRITER
Thomas Tay
SEO Head at Khepri Digital’s Singapore branch. An SEO nerd by day, he is responsible for the research & development to assess the impact and changes in Google algorithms. His work powers the processes behind Khepri Digital’s SEO manual. When not looking at SEO experiments, he tends to be around the Northern Area of Singapore, chomping on Satay and Chicken wings.