Table of Contents
- .htaccess Basics
- Error Handling
- Password Protection with .htaccess
- Enabling Server Side Includes (SSI)
- IP Blacklisting and IP Whitelisting
- Block Users by Referrer
- Blocking Bots and Web Scrapers
- Specifying a Default File for a Directory
- URL Redirects and URL Rewriting
- Hiding Your .htaccess File
- Enabling MIME types
- Block Hotlinking
- Disable or Enable Index
- Scripts as Source Code
- Configuring PHP Settings
- When Not to Use .htaccess
- Troubleshooting
- My .htaccess configuration
- .htaccess Protection For POST Abuse (WordPress)
This guide was built to serve as a comprehensive resource to using .htaccess.
.htaccess Basics
What is .htaccess
?
The .htaccess
file is a configuration file that affects how a web server responds to various requests. It is supported by several web servers, including the popular Apache software used by most web hosting providers.
.htaccess
files operate at the level of a directory, allowing them to override global configuration settings of .htaccess
directives higher in the directory tree.
Why is it called .htaccess
?
These files were first used to control user access on a per-directory basis. Using a subset of Apacheâs http.conf
settings directives, it allowed a system administrator to restrict access to individual directories to users with a name and password specified in an accompanying .htpasswd
file.
While .htaccess
files are still used for this, they are also used for a number of other things which weâll cover in this guide.
Where is the .htaccess
file?
In theory, every folder (directory) on your server could have one. Generally, though, there is one in your web root folder â thatâs the folder that holds all the content of your website, and is usually labeled something like public_html
, www
or httpdocs
.
If you have a single directory that contains multiple website subdirectories, there will usually be an .htaccess
file in the main root directory and also one in each subdirectory (/sitename
).
Why canât I find my .htaccess
file?
On most file systems, file names that begin with a dot ( .
) are hidden files. This means they are not typically visible by default.
But they arenât hard to get to. Your FTP client or File Manager should have a setting for âshow hidden filesâ. This will be in different places in different programs, but is usually in âPreferencesâ, âSettingsâ, or âFolder Optionsâ. Sometime youâll find it in the âViewâ menu.
What if I donât have an .htaccess
file?
First of all, make sure that you have turned on âshow hidden filesâ (or its equivalent), so that you can be sure you actually donât have one. Often, .htaccess
files are created automatically, so you will usually have one. But this isnât always the case.
If you really donât have one, you can easily create one:
- Start a new file in a plain text editor.
- Save it in
ASCII
format (notUTF-8
or anything else) as.htaccess
.- Make sure that it isnât
htaccess.txt
or something like that. The file should have only the name.htaccess
with no additional file extension.
- Make sure that it isnât
- Upload it to the appropriate directory via FTP or your browser-based file manager.
Error Handling
Using .htaccess
files to specify error documents is very simple, one of the simplest things you can do with this feature.
What is an error code?
When a request is made to a web server, it tries to respond to that request, usually by delivering a document (in the case of HTML pages), or by accessing an application and returning the output (in the case of Content Management Systems and other web apps).
If something goes wrong with this, an error is generated. Different types of errors have different error codes. You are probably familiar with the 404
error, which is returned if the document cannot be found on the server.
There are many other error codes that a server can respond with.
Client Request Errors
- 400 â Bad Request
- 401 â Authorization Required
- 402 â Payment Required (not used yet)
- 403 â Forbidden
- 404 â Not Found
- 405 â Method Not Allowed
- 406 â Not Acceptable (encoding)
- 407 â Proxy Authentication Required
- 408 â Request Timed Out
- 409 â Conflicting Request
- 410 â Gone
- 411 â Content Length Required
- 412 â Precondition Failed
- 413 â Request Entity Too Long
- 414 â Request URI Too Long
- 415 â Unsupported Media Type
Server Errors
- 500 â Internal Server Error
- 501 â Not Implemented
- 502 â Bad Gateway
- 503 â Service Unavailable
- 504 â Gateway Timeout
- 505 â HTTP Version Not Supported
What happens by default?
If you donât specify any type of error handling, the server will simply return the message to the browser, and the browser will display a generic error message to the user. This is usually not ideal.
Specifying Error Documents
Create an HTML document for each error code you want to handle. You can name these whatever you like, but itâs helpful to name them something that will help you remember what theyâre for, like not-found.html
or simply 404.html
.
Then, in the .htaccess
file, specify which document to use with each type of error.
ErrorDocument 400 /errors/bad-request.html
ErrorDocument 401 /errors/auth-reqd.html
ErrorDocument 403 /errors/forbid.html
ErrorDocument 404 /errors/not-found.html
ErrorDocument 500 /errors/server-err.html
Notice that each directive is placed on its own line.
And thatâs it. Very simple.
Alternatives to .htaccess
for error handling
Most content management systems (CMS) like WordPress, and most web apps, will have their own way of handling most of these errors codes.
Password Protection with .htaccess
The original purpose of .htaccess
files was to restrict access to certain directories on a per-user basis (hence the name, hypertext access). So weâll look at that first.
.htpasswd
Usernames and passwords for the .htaccess
system are stored in a file name .htpasswd
.
These are stored each on a single line, in the form:
username:encryptedpassword
for example:
johnsmith:F418zSM0k6tGI
Itâs important to realize that the password stored in the file isnât the actual password used to log in. Rather it is a cryptographic hash of the password.
This means that the password has been run through an encryption algorithm, and the result is stored. When a user logs in, the plain-text password is entered and run through the same algorithm. If the input is the same, the passwords match and the user is granted access.
Storing passwords this way makes them more secure â if someone were to gain access to your .htpasswd
file, they would only see the hashed passwords, not the originals. And there is no way to reconstruct the originals from the hash â it is a one way encryption.
Several different hashing algorithms can be used:
- Secure Algorithms â Use one of these
- bcrypt â This is the most secure, but also the slowest to compute. It is supported by Apache and Nginx.
- md5 â This is the default hashing algorithm used by current versions of Apache. It is not supported by Nginx.
- Insecure Algorithms â Do not use these
- crypt() â This used to be the default hashing function, but it is not very secure.
- SHA and Salted SHA
Creating usernames and passwords on the command line
You can create an .htpasswd
file, and add username-password pairs to it, directly from the command line or SSH terminal.
The command for dealing with the .htpasswd
file is simply htpasswd
.
To create a new .htpasswd
file, use the command with the -c
option (for create), then type the path to the directory (not the URL, the actual path on the server). You can also include a user you want to add.
> htpasswd -c /usr/local/etc/.htpasswd johnsmith
This creates a new .htpasswd
file in the /etc/
directory, and adds a record for a user named johnsmith
. You will be prompted for a password, which will also be stored, using the md5 encryption.
If there is already an .htpasswd
file at the specified location, a new one is not created â the new user is simply appended to the existing file.
If youâd prefer to use the bcrypt hashing algorithm, use the -b
option.
Password hashing without the command line
If you donât feel comfortable using the command line or SSH terminal (or if you donât have access to it for some reason), you can simply create an .htpasswd
file and populate it using a plain text editor, and upload it via FTP or file manager.
But then youâll need to encrypt your passwords somehow, since the htpasswd
command was taking care of that for you.
There are many .htpasswd
encryption utilities available online.
This gives you several options for hashing algorithm and password strength. You can simply copy-and-paste the output from there into your .htpasswd
file.
Where to keep your .htpasswd
file
You donât need to have a separate .htpasswd
file for every .htaccess
file. In fact, you shouldnât. Under most normal circumstances, you should have one for your entire web hosting account or main server directory.
The .htpasswd
file should not be in a publicly accessible directory â not Public_HTML
or www
or any subdirectory. It should be above those, in a folder that is only accessible from the server itself.
How to use .htpasswd
with .htaccess
Each directory can have its own .htaccess
file, with its own set of users which are allowed to access it.
If you want anyone (including non-logged-in users) to access the directory and its files, simply do nothing â that is the default.
To restrict access you need to add the following to the .htaccess
file:
AuthUserFile /usr/local/etc/.htpasswd
AuthName "Name of Secure Area"
AuthType Basic
<Limit GET POST>
require valid-user
</Limit>
The first line specifies the path and file name to your list of usernames and passwords. The second line specifies a name for the secured area. This can be anything you like. The third line specifies âBasicâ authentication, which is what you usually need.
The <Limit>
tag specifies what is being limited (in this case, the ability to GET or POST to any file in the directory). Within the pair of <Limit>
tags is a list of who is allowed to access files.
In the above example, any valid user can access files. If you want to restrict access to a specific user or few users, you can name them.
AuthUserFile /usr/local/etc/.htpasswd
AuthName "Name of Secure Area"
AuthType Basic
<Limit GET POST>
require user johnsmith
require user janedoe
</Limit>
You can also put users into groups and allow access based on group. This is done by adding another file which specifies the groups.
The group file, which could be named (for example) .htgroups
looks like this:
admin: johnsmith janedoe
staff: jackdoe cindysmith
Then you can specify it in your .htaccess
file:
AuthUserFile /usr/local/etc/.htpasswd
AuthGroupFile /usr/local/etc/.htgroup
AuthName "Admin Area"
AuthType Basic
<Limit GET POST>
require group admin
</Limit>
Alternatives to .htpasswd
Using .htaccess
and .htpasswd
to resrict access to certain files on your server only really makes sense if you have a lot of static files. The feature was developed when web sites were usually a collection of HTML documents and related resources.
If you are using a content management system (CMS) like WordPress or Drupal, you can use the built-in user management features to restrict or grant access to content.
Enabling Server Side Includes (SSI)
What are Server Side Includes?
SSI, or Server Side Includes, is a light-weight scripting language used primarily to embed HTML documents into other HTML documents.
This makes it easy to re-use common elements, such as headers, footers, sidebars, and menus. You can think of it as a precursor to todayâs templating and content management systems.
<!-- include virtual="header.shtml" -->
SSI also has conditional directives (if
, else
, etc.) and variables, making it a complete, if somewhat hard to use, scripting language. (Typically, any project more complicated than a handful of includes will cause a developer to choose a more robust language like PHP or Perl.)
Enabling SSI
Some web hosting servers will have Server Side Includes enabled by default. If not, you can enable it with your .htaccess
file, like so:
AddType text/html .shtml
AddHandler server-parsed .shtml
Options Indexes FollowSymLinks Includes
This should enable SSI for all files that have the .shtml
extension.
SSI on .html
files
If you want to enable SSI parsing on .html
files, you can add a directive to accomplish that:
AddHandler server-parsed .html
The benefit of doing this is that you can use SSI without letting the world know you are using it. Also, if you change implementations in the future, you can keep your .html
file extensions.
The downside of this is that every .html
file will be parsed with SSI. If you have a lot of .html
files that donât actually need any SSI parsing, this can introduce a lot of unneeded server overhead, slowing down your page load times and using up CPU resources.
SSI on your Index page
If you donât want to parse all .html
files, but you do want to use SSI on your index (home) page, youâll need to specify that in your .htaccess
file.
Thatâs because when the web server is looking for a directoryâs index page, it looks for index.html
, unless you tell it otherwise. If you arenât parsing .html
files, youâll need your index page to be named index.shtml
for SSI to work, and your server doesnât know to look for that by default.
To enable that, simply add:
DirectoryIndex index.shtml index.html
This alerts the web server that the index.shtml
file is the main index file for the directory. The second parameter, index.html
is a backup, in case index.shtml
canât be found.
IP Blacklisting and IP Whitelisting
You can use .htaccess
to block users from a specific IP address (blacklisting). This is useful if you have identified individual users from specific IP addresses which have caused problems.
You can also do the reverse, blocking everyone except visitors from a specific IP address (whitelisting). This is useful if you need to restrict access to only approved users.
Blacklisting by IP
To block specific IP addresses, simply use the following directive, with the appropriate IP addresses:
order allow,deny
deny from 111.22.3.4
deny from 789.56.4.
allow from all
The first line states that the allow
directives will be evaluated first, before the deny
directives. This means that allow from all
will be the default state, and then only those matching the deny
directives will be denied. If this was reversed to order deny,allow
, then the last thing evaluated would be the allow from all
directive, which would allow everybody, overriding the deny
statements.
Notice the third line, which has deny from 789.56.4.
â that is not a complete IP address. This will deny all IP addresses within that block (any that begin with 789.56.4
).
You can include as many IP addresses as you like, one on each line, with a deny from
directive.
Whitelisting by IP
The reverse of blacklisting is whitelisting â restricting everyone except those you specify.
As you may guess, the order
directive has to be reversed, so that that everyone is first denied
, but then certain addresses are allowed
.
order deny,allow
deny from all
allow from 111.22.3.4
allow from 789.56.4.
Domain names instead of IP addresses
You can also block or allow users based on a domain name. This can be help block people even as they move from IP address to IP address. However, this will not work against people who can control their reverse-DNS IP address mapping.
order allow,deny
deny from example.com
allow from all
This works for subdomains, as well â in the previous example, visitors from xyz.example.com
will also be blocked.
Block Users by Referrer
A referrer is the website that contains a link to your site. When someone follows a link to a page on your site, the site they came from is the referrer.
This doesnât just work for clickable hyperlinks to your website, though. Pages anywhere on the internet can link directly to your images (âhotlinkingâ) â using your bandwidth, and possibly infringing on your copyright, without providing any benefit to you in terms of traffic. They can also hotlink to your CSS files, JS scripts, or other resources.
Most website owners are okay with this when happens just a little bit, but sometimes this sort of thing can turn into abuse.
Additionally, sometimes actual in-text clickable hyperlinks are problematic, such as when they come from hostile websites.
For any of these reasons, you might want to block requests that come from specific referrers.
To do this, you need the mod_rewrite
module enabled. This is enabled by default for most web hosts, but if it isnât (or you arenât sure) you can usually just ask your hosting company. (If they canât or wonât enable it, you might want to think about a new host.)
The .htaccess
directives that accomplish referrer-based blocking rely on the mod_rewrite
engine.
The code to block by referrer looks like this:
RewriteEngine on
RewriteCond %{HTTP_REFERER} ^https://.*example\.com [NC,OR]
RewriteCond %{HTTP_REFERER} ^https://.*anotherexample\.com [NC,OR]
RewriteCond %{HTTP_REFERER} ^https://.*onemoreexample\.com [NC]
RewriteRule .* - [F]
This is a little tricky, so lets walk through it.
The first line, RewriteEngine on
, alerts the parser that a series of directives related to rewrite is coming.
The next three lines each block one referring domain. The part you would need to change for your own use is the domain name (example
) and extension (.com
).
The backward-slash before the .com
is an escape character. The pattern matching used in the domain name is a regular expression, and the dot means something in RegEx, so it has to be âescapedâ using the back-slash.
The NC
in the brackets specifies that the match should not be case sensitive. The OR
is a literal âorâ, and means that there are other rules coming. (That is â if the URL is this one or this one or this one, follow this rewrite rule.)
The last line is the actual rewrite rule. The [F] means âForbidden.â Any requests with a referrer matching the ones in the list will fail, and deliver a 403 Forbidden
error.
Blocking Bots and Web Scrapers
One of the more annoying aspects of managing a website is discovering that your bandwidth is being eaten up by non-human visitors â bots, crawlers, web scrapers. These are programs that are designed to pull information out of your site, usually for the purpose of republishing it as part of some low-grade SEO operation.
There, of course, legitimate bots â like those from major search engines. But the rest are like pests that just eat away at your resources and deliver no value to you whatsoever.
There are several hundred bots identified. You will never be able to block all of them, but you can keep the activity down to a dull roar by blocking as many as you can.
Here is a good security solution for WordPress websites.
Specifying a Default File for a Directory
When a request is made to a web server for a URL which does not specify a file name, the assumption built into most web servers is that the URL refers to a directory.
So, if you request https://example.com
, Apache (and most other web servers) is going look in the root directory for the domain (usually /Public_HTML
or something similar, but perhaps /example-com
) for the default file.
The default file, by default, is called index.html
. This goes way back to the beginning of the internet when a website was just a collection of documents, and the âhomeâ page was usually an index of those documents.
But you might not want index.html
to be the default page. For example, you might need a different file type, like index.shtml
, index.xml
, or index.php
.
Or you might not think of your home page as an âindexâ, and want to call it something different, like home.html
or main.html
.
Setting the Default Directory Page
.htaccess
allows you to set the default page for a directory easily:
DirectoryIndex [filename here]
If you want your default to be home.html
itâs as easy as:
DirectoryIndex home.html
Setting Multiple Default Pages
You can also specify more than one DirectoryIndex
:
DirectoryIndex index.php index.shtml index.html
The way this works is that the web server looks for the first one first. If it canât find that, it looks for the second one, and so on.
Why would you want to do this? Surely you know which file you want to use as your default page, right?
Remember that .htaccess
affects its own directory, and every subdirectory until it is overridden by a more local file. This means that an .htaccess
file in your root directory can provide instructions for many subdirectories, and each one might have its own default page name. Being able to place those rules in a single .htaccess
file in the root means that you donât have to duplicate all the other directives in the file at every directory level.
URL Redirects and URL Rewriting
One of the most common uses of .htaccess
files is URL redirects.
URL redirects should be used when the URL for a document or resource has changed. This is especially helpful if you have reorganized your website or changed domain names.
301 vs. 302
From a browser standpoint, there are two types of redirects, 301 and 302. (These numbers refer to the error code generated by the web server.)
301 means âPermanently Moved,â while 302 means âMoved Temporarily.â In most cases, you want to use 301. This preserves any SEO equity the original URL had, passing it on to the new page. It also will cause most browsers to update their bookmarks. Most browsers will also cache the old-to-new mapping, so they will simply request the new URL when a link or user attempts to access the original. If the URL has changed permanently, these are all desirable results.
Thereâs very little reason to use 302 redirects, since thereâs usually very little reason to temporarily change a URL. Changing a URL ever is undesirable, but is sometimes necessary. Changing it temporarily, with the plan to change it back later, is a bad idea and is almost always avoidable.
All the examples in this section will use the 301 redirect.
Redirect vs. Rewrite
There are two different ways to âchangeâ a URL with .htaccess
directives â the Redirect
command and the mod_rewrite
engine.
The Redirect
command actually sends a redirect message to the browser, telling it what other URL to look for.
Typically, the mod_rewrite
tool âtranslatesâ one URL (the one provided in a request) into something that the file system or CMS will understand, and then handles the request as if the translated URL was the requested URL. When used this way, the web browser doesnât notice that anything happened â it just receives the content it asked for.
The mod_rewrite
tool can also be used to produce 301 redirects that work the same way as the Redirect
command, but with more options for rules â mod_rewrite
can have complex pattern matching and rewriting instructions, which Redirect
cannot take advantage of.
Basic Page Redirect
To redirect one page to another URL, the code is:
Redirect 301 /relative-url.html https://example.com/full-url.html
This single-line command has four parts, each separated with a single space:
- The
Redirect
command - The type of redirect (
301 - Moved Permanently
) - The relative URL of the original page
- The full and complete URL of the new page
The relative URL is relative to the directory containing the .htaccess
file, which is usually the web root, or the root of the domain.
So if https://example.com/blog.php
had been moved to https://blog.example.com
, the code would be:
Redirect 301 /blog.php https://blog.example.com
Redirecting a large section
If you have moved your directory structure around, but kept your page names the same, you might want to redirect all requests for a certain directory to the new one.
Redirect 301 /old-directory https://example.com/new-directory
Redirecting an entire site
What if you entire site has moved to a new URL? Easy.
Redirect 301 / https://newurl.com
Redirecting www to non-www
Increasingly, websites are moving away from the www
subdomain. Itâs never really been necessary, but it was a holdover from the days when you most people who operated a website were using a server to store lots of their own documents, and the www
or âworld wide webâ directory was used for content they wanted to share with others.
These days, some people use it, and some people donât. Unfortunately, some users still automatically type www.
in front of every URL out of habit. If youâre not using www
, you want to make sure that these requests land in the right place.
To do this, youâll need to use the mod_rewrite
module, which is probably already installed on your web host.
Options +FollowSymlinks
RewriteEngine on
RewriteCond %{http_host} ^www\.example\.com [NC]
RewriteRule ^(.*)$ https://example.org/$1 [R=301,NC]
Be careful!
A lot of other .htaccess
and mod_rewrite
guides offer some variation of the following code to accomplish this:
Options +FollowSymlinks
RewriteEngine on
RewriteCond %{http_host} !^example\.com [NC]
RewriteRule ^(.*)$ https://example.org/$1 [R=301,NC]
Do you see the problem with that?
It redirects all subdomains to the primary domain. So not just www.example.com
, but also blog.example.com
and admin.example.com
and anything else. This is probably not the behavior you want.
Redirecting to www
But what if you are using the www
subdomain?
You should probably set up a redirect to make sure people get to where theyâre trying to go. Especially now that fewer people are likely to automatically add that www
to the beginning of URLs.
You just reverse the above code.
RewriteEngine On
RewriteCond %{http_host} ^example.com [NC
RewriteRule ^(.*) https://example.com/$1 [R=301,NC]
Something you shouldnât do
Several guides on .htaccess
redirects include instructions on how to make 404 errors redirect to the home page.
This is a good example of how just because you can do something, it doesnât mean you should do something.
Redirecting 404 errors to the siteâs homepage is a terrible idea. It confuses visitors, who canât figure out why they are seeing the front page of a site instead of a proper 404 error page.
All websites should have a decent 404 page which clearly explains to the user that the content couldnât be found and, ideally, offers some search features to help the user find what they were looking for.
Why use .htaccess
instead of other alternatives?
You can set up redirect in PHP files, or with any other type of server-side scripting. You can also set them up within your Content Management System (which is basically the same thing).
But using .htaccess
is usually the fastest type of redirect. With PHP-based redirects, or other server-side scripting languages, the entire request must be completed, and the script actually interpreted before a redirect message is sent to the browser. With .htaccess
redirects, the server responds directly to the request with the redirect message. This is much faster.
You should note, though â some content management systems actually manage redirects by updating the .htaccess
programatically. WordPress, for example, has redirect plugins that work this way. (And WPâs pretty URL system does this as well.) This gives you the performance of using .htaccess
directly, while also giving you the convenience of management from within your application.
Hiding Your .htaccess File
There is no reason that someone should be able to view your .htaccess
file from the web. It is never a needed document.
Moreover, there are some big reasons you should definitely not want people to see your .htaccess
file. The biggest issue is that if you are using an .htpasswd
file, its location is spelled out in the .htaccess
file. Knowing where to find it makes it easier to find.
Moreover, as a general rule, you donât want to provide the public with details about your implementation. Rewrite rules, directory settings, security â all of the things that you use .htaccess
for â it is a good security practice to hide all of this behind-the-scenes at your web server. The more a hacker can learn about your system, the easier it is to compromise it.
It is very easy to hide your .htaccess
file from public view. Just add the following code:
<Files .htaccess>
order allow,deny
deny from all
</Files>
Enabling MIME types
MIME types are file types. Theyâre called MIME types because of their original association with email (MIME stands for âMultipurpose Internet Mail Extensionsâ). They arenât just called âfile typesâ because MIME implies a specific format for specifying the file type.
If youâve ever authored an HTML document, youâve like specified a MIME type, even if you didnât know it:
<style type="text/css" src="/style.css" />
The type
attribute refers to a specific MIME type.
MIME types on your server
Sometimes youâll find that your web server isnât configured to deliver a particular type of file. It just doesnât work â requests for the file simply fail.
In most cases, you can fix this problem by adding the MIME type to your .htaccess
file.
AddType text/richtext rtx
This directive has three parts, each separated by a space:
- The
AddType
comman - The MIME type
- The file extension
If you want to associate several different file extensions with the same MIME type, you can do that on a single line.
AddType image/jpeg jpeg jpg jpe JPG
Force Download by MIME Type
If you want all links to specific file types to launch as downloads, instead of being opened in the browser, you do that with the MIME type application/octet-stream
, like this:
AddType application/octet-stream pdf
Again, you can specify multiple file extensions with a single type:
AddType application/octet-stream pdf doc docx rtf
List of File Extensions and MIME Types
Here is a not-quite-complete list of file formats and associated MIME types.
If you are managing your own website, and you know what file types you publish resources in, then there is no need to paste this entire list into your .htaccess
file.
However, if you run a site that many other people are contributing and publishing content to, you may want to simply allow a large number of file types this way to make sure no one has a bad experience. This is especially the case if you run a site where people might be specifically sharing a lot of files, for example a file sharing site, a project management application (where many files will often be attached to project), or a web app that handles email.
AddType application/macbinhex-40 hqx
AddType application/netalive net
AddType application/netalivelink nel
AddType application/octet-stream bin exe
AddType application/oda oda
AddType application/pdf pdf
AddType application/postscript ai eps ps
AddType application/rtf rtf
AddType application/x-bcpio bcpio
AddType application/x-cpio cpio
AddType application/x-csh csh
AddType application/x-director dcr
AddType application/x-director dir
AddType application/x-director dxr
AddType application/x-dvi dvi
AddType application/x-gtar gtar
AddType application/x-hdf hdf
AddType application/x-httpd-cgi cgi
AddType application/x-latex latex
AddType application/x-mif mif
AddType application/x-netcdf nc cdf
AddType application/x-onlive sds
AddType application/x-sh sh
AddType application/x-shar shar
AddType application/x-sv4cpio sv4cpio
AddType application/x-sv4crc sv4crc
AddType application/x-tar tar
AddType application/x-tcl tcl
AddType application/x-tex tex
AddType application/x-texinfo texinfo texi
AddType application/x-troff t tr roff
AddType application/x-troff-man man
AddType application/x-troff-me me
AddType application/x-troff-ms ms
AddType application/x-ustar ustar
AddType application/x-wais-source src
AddType application/zip zip
AddType audio/basic au snd
AddType audio/x-aiff aif aiff aifc
AddType audio/x-midi mid
AddType audio/x-pn-realaudio ram
AddType audio/x-wav wav
AddType image/gif gif GIF
AddType image/ief ief
AddType image/jpeg jpeg jpg jpe JPG
AddType image/tiff tiff tif
AddType image/x-cmu-raster ras
AddType image/x-portable-anymap pnm
AddType image/x-portable-bitmap pbm
AddType image/x-portable-graymap pgm
AddType image/x-portable-pixmap ppm
AddType image/x-rgb rgb
AddType image/x-xbitmap xbm
AddType image/x-xpixmap xpm
AddType image/x-xwindowdump xwd
AddType text/html html htm
AddType text/plain txt
AddType text/richtext rtx
AddType text/tab-separated-values tsv
AddType text/x-server-parsed-html shtml sht
AddType text/x-setext etx
AddType video/mpeg mpeg mpg mpe
AddType video/quicktime qt mov
AddType video/x-msvideo avi
AddType video/x-sgi-movie movie
AddType x-world/x-vrml wrl
Block Hotlinking
Hotlinking is the practice of linking to resources from other domains instead of uploading the content to your own server and serving it yourself.
Say you find an image on a website that you really like, and you want to use it on your site. Ignoring copyright issues for the moment â you could download the image, upload it to your website, and embed it on your page like normal.
<img src="https://yourdomain.com/image.jpg">
But if you were lazy, or trying to save bandwidth, or didnât know how to upload a file, you could just embed it directly form the original file.
<img src="https://originaldomain.com/image.jpg">
Thatâs hotlinking. It also happens with CSS and JS files, but images are the most common.
Some websites/hosts donât mind at all if you do this â you can hotlink images from Wikipedia without anyone being upset. And some websites encourage it in one form or another. For example, JQuery provides their JS libraries via a CDN (Content Delivery Network), so you can hotlink directly to it without having to upload it and serve it from your own server.
But many web host consider hotlinking to be a form of bandwidth and resource stealing. To be sure, if you are running a relatively small site, you canât afford to have thousands, or tens of thousands, of requests being made every day for resources that have nothing to do with actual visitors to your site.
If you are having a problem with hotlinking, you can disable it with some mod_rewrite
rules added to your .htaccess
file.
RewriteEngine on RewriteCond %{HTTP_REFERER} !^$ RewriteCond %{HTTP_REFERER} !^https://(www\.)?example.com/.*$ [NC] RewriteRule \.(gif|jpg|jpeg|png|js|css)$ - [F]
Be sure to change example.com
in the third line to your actual domain name. This will catch any requests not coming from your domain, and then check if it matches one of the specified file extensions in the fourth line. If there is a match, the request fails.
If you want to add other file extensions, you can simply edit the last line.
Serving up Alternative Content
If you want to let the world know why their hotlinking has suddenly stopped working, you can replace hotlinked images with a special image with a message like, âWe hate hotlinking!â or âOriginal Content Available at https://example.comâ.
Instead of failing the request, you simply redirect it to the âspecialâ image:
RewriteEngine on RewriteCond %{HTTP_REFERER} !^$ RewriteCond %{HTTP_REFERER} !^https://(www\.)?example.com/.*$ [NC] RewriteRule \.(gif|jpg)$ https://www.example.com/no-hotlinking.jpg [R,L]
Disable or Enable Index
What happens if you have a directory full of documents or other resources, no index.html
file, and no default directory page specified in the .htaccess
file?
In many cases, the result will be a generic directory listing of all the files in the directory.
Thatâs right. If you have a folder in your hosting directory labeled /images
, and it has no index.html
page, when someone navigates to https://example.com/images
, they will be able to see a list of all the images on your site.
Thatâs the default behavior of most web servers, and it makes sense from the standpoint of the original conception of a website as simply a place to keep and share documents. But this is not the desired behavior for most sites.
Disabling Indexes
Many web hosting accounts will have disable this already as part of their global configuration. But not all do so.
If you need to disable automatically generated directory listings, doing so is easy:
Options -Indexes
Enabling Indexes
If your web server has disabled indexes as part of global configuration, but you do want them, you can enable them with the reverse of the above command.
Options +Indexes
Hiding some files from the Index
If you want to show directory listings, but you want to hide certain file types from the list, you can do that too.
IndexIgnore *.gif *.jpg
The *
is a wild-card character. The above directive will hide all files that have a .gif
or .jpg
extension. If you wanted to be more specific, you could:
IndexIgnore secret-image.jpg
Enabling CGI Everywhere
CGI, or Common Gateway Interface, is server-side method for including non-HTML scripts (like Perl or SSI) in web pages.
Typically, CGI scripts are stored in a folder labeled /cgi-bin
. The webserver is configured to treat any resource in that directory as a script, rather than a page.
The problem with that is two-fold: â URLs referencing CGI resources need to have /cgi-bin/
in them, which places implementation details into your URL â an anti-pattern to be avoided for a number of reasons. â a complex website may need a better organization structure than simply having a ton of scripts jammed into a single /cgi-bin
folder.
If you want your web server to parse CGI scripts no matter where they are found in your directory structure, just add the following to your .htaccess
file:
AddHandler cgi-script .cgi
Options +ExecCGI
If you have other file extensions you want processed as CGI scripts, you can add them in the first line.
Scripts as Source Code
Most of time, you put scripts in your web directory because, well, you want them to be run as scripts.
But sometimes that isnât what you want. Sometimes you want to display the source code to public visitors, instead of running the script.
This might be the case if you run a file sharing service or a code repository site, and you want people to see the source code and be able to download it, but the scripts are actually part of your siteâs functionality.
This can be done in your .htaccess
file by removing the script handler for certain file types and replacing it with a handler for text.
RemoveHandler cgi-script .pl .cgi .php .py
AddType text/plain .pl .cgi .php .py
Alternatively, as mention previously, you could force files with these extensions to be downloaded automatically, rather than displayed.
RemoveHandler cgi-script .pl .cgi .php .py
AddType application/octet-stream .pl .cgi .php .py
Be careful with either of these, though. If you only want some files to be displayed this way, but are still using these scripts for the rest of your website, your going to have a bad time if you put that directive into your web rootâs .htaccess
file.
A better practice would be to place all such âdisplay onlyâ scripts into a single directory, and then place the directive into an .htaccess
file there in that folder.
Configuring PHP Settings
Sometimes you need to tweak PHPâs settings. The right way to do this is in a file called php.ini
.
Unfortunately, not all web hosting companies allow their customers to edit the php.ini
file. This is especially true of shared hosting providers, where a single installation of PHP may be running hundreds of web sites.
Fortunately, thereâs a workaround â you can embed php.ini
rules into your .htaccess
file.
The syntax looks like:
php_value [setting name] [value]
So, for example, if you need to increase the max file upload size (a common issue), it is as easy as:
php_value upload_max_filesize 10M
Not all PHP settings can be specified in .htaccess
files. For example you can not disable_classes
this way.
For a complete list of all php.ini
settings, see the official php.ini directives guide.
When Not to Use .htaccess
Editing your .htaccess
file for the first time can give you sudden feeling of immense power over your web hosting environment. You suddenly feel like a sysadmin.
Unfortunately, this power can go to your head, and you may find yourself using the .htaccess
file in ways that arenât really the best.
When you need to do something that seems like an .htaccess
sort of job, thereâs basically two situations where you should put that directive somewhere else.
Further Upstream
Whenever possible, the types of directives you can place in an .htaccess
file are better off being place in the httpd.conf
file, which is a configuration settings file for the entire server.
Similarly, PHP settings more properly belong in the php.ini
file, and most other languages have similar configuration setting files.
Placing directives further upstream, in the httpd.conf
, php.ini
, or other language-specific configuration file allows those settings to be âbaked-inâ to the web serverâs parsing engine. With .htaccess
, the directives have to be checked and interpreted with every single request.
If you have a low traffic site with only a handful of .htaccess
directives, this isnât a big deal. But if you have a lot of traffic, and a lot of directives, the performance lag can really add up.
Unfortunately, many shared hosting providers do not allow customers to access the httpd.conf
or php.ini
files, forcing users to rely on the slower .htaccess
file. This provides a double-penalty when compared to custom VPS configurations because shared hosting is also generally low-powered. This is one of the reasons that a site with respectable traffic should probably be on a VPS plan instead of shared hosting plan.
Further Downstream
If you are using a good Content Management System (CMS) such as WordPress or Drupal, some of the things you might do in an .htaccess
file â such as redirect URLs or block IP addresses â can be done from inside the application.
Often, this works in conjunction with the .htaccess
file, with the application programmatically adding directives.
When this is available, it is usually best to accomplish these tasks from inside the application, rather than editing the .htaccess
file yourself. You are less likely to introduce bugs and incompatible directives if you use a well-tested, open source plugin, rather than editing it yourself.
Troubleshooting
Messing around with your .htaccess
file can be great â but it can also cause your server to seize up and start delivering 500 Internal Server Error
messages.
Hereâs a few ideas to help you through that.
Do one thing at a time
This should go without saying, but â sadly â itâs a lesson many of us have to learn over and over again.
Do one thing. Then test it. Then do another thing. Test that.
If you do several things all at once, and then something fails, you wonât know which directive is causing the problem.
Backup your file before each thing
Along with doing only one thing at a time, you should save your file between each thing you are trying. Your saved archive needs to be restorable. This isnât Microsoft Word where you can just Undo â you need a saved copy of your file.
You should always have the latest working version available in case you mess something up. Always, always, always have the ability to restore to a working version.
This is easiest if you some kind of source management system like Git. You can commit after each change, and roll back if you run into any problems.
Check the error logs
If you do run into a problem, and youâre having a hard time figuring out why, check your Apache error logs. These often provide valuable information about where to look.
Ask around
Developer forums and Q&A sites like StackOverflow are invaluable tools for even the most seasoned developers and sysadmins. And donât forget Google. Often, the difference between a bad web master and great one isnât knowing the answer, its knowing where to find the answer.
Common .htaccess
problems
Sometimes you made a typo. Sometimes you have an esoteric and confusing problem caused by a confluence of unpredictable factors.
Most problems, and the really frustrating ones, are the ones in the middle â the simple, everyday problems that are easy to fix if you just knew about them.
Hereâs a few of those.
Bad Filename
There is only one way to spell .htaccess
â it has to begin with the dot, and it must be in all lowercase letters.
It seems dumb, but if your .htaccess
file isnât doing what you expect, that should be the first thing you check.
.htaccess Disabled, or Partly Disabled
Some shared hosting providers disable .htaccess
altogether. Others allow it, but restrict certain directives from being used â theyâre just ignored if included.
Similarly, even on VPS plans or your own dedicated servers, .htaccess
might be disabled.
If you have access to the httpd.conf
file, or other server settings, you can check this yourself. If you find the directive AllowOverride None
, you found the culprit. Replace it with AllowOverride All
.
If you donât have access to your httpd.conf
file (because youâre on shared hosting, for example), you may need to contact your hosting companyâs tech support and see if they can enable it for you, or offer you suggestions on ow to accomplish what youâre trying to do in a different way.
Conflicting or Overridden Directives
If you have multiple nested directories, itâs possible for each one to have its own .htaccess
file. Every .htaccess
file from the root, through each nested directory, applies â they are read in order, descending down the directory tree.
If you set something in your root directory, and then something in subdirectory overrides it, the directive in the .htaccess
file closest to the requested file will take precedence.
My .htaccess
configuration
Hereâs my .htaccess
configuration:
AddDefaultCharset utf-8
DefaultLanguage en-US
# Add Proper MIME-Type for Favicon
AddType image/x-icon .ico
<IfModule mod_deflate.c>
# Compress HTML, CSS, JavaScript, text, XML and fonts
AddOutputFilterByType DEFLATE application/javascript
AddOutputFilterByType DEFLATE application/rss+xml
AddOutputFilterByType DEFLATE application/vnd.ms-fontobject
AddOutputFilterByType DEFLATE application/x-font
AddOutputFilterByType DEFLATE application/x-font-opentype
AddOutputFilterByType DEFLATE application/x-font-otf
AddOutputFilterByType DEFLATE application/x-font-woff
AddOutputFilterByType DEFLATE application/x-font-truetype
AddOutputFilterByType DEFLATE application/x-font-ttf
AddOutputFilterByType DEFLATE application/x-javascript
AddOutputFilterByType DEFLATE application/xhtml+xml
AddOutputFilterByType DEFLATE application/xml
AddOutputFilterByType DEFLATE font/opentype
AddOutputFilterByType DEFLATE font/otf
AddOutputFilterByType DEFLATE font/woff
AddOutputFilterByType DEFLATE font/ttf
AddOutputFilterByType DEFLATE image/svg+xml
AddOutputFilterByType DEFLATE image/x-icon
AddOutputFilterByType DEFLATE image/vnd.microsoft.icon
AddOutputFilterByType DEFLATE text/css
AddOutputFilterByType DEFLATE text/html
AddOutputFilterByType DEFLATE text/javascript
AddOutputFilterByType DEFLATE text/plain
AddOutputFilterByType DEFLATE text/xml
# Force compression for mangled `Accept-Encoding` request headers
# https://developer.yahoo.com/blogs/ydn/pushing-beyond-gzipping-25601.html
<IfModule mod_setenvif.c>
<IfModule mod_headers.c>
SetEnvIfNoCase ^(Accept-EncodXng|X-cept-Encoding|X{15}|~{15}|-{15})$ ^((gzip|deflate)\s*,?\s*)+|[X~-]{4,13}$ HAVE_Accept-Encoding
RequestHeader append Accept-Encoding "gzip,deflate" env=HAVE_Accept-Encoding
</IfModule>
</IfModule>
</IfModule>
# BEGIN WordPress
<IfModule mod_rewrite.c>
RewriteEngine On
RewriteBase /
RewriteRule ^index\.php$ - [L]
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L]
</IfModule>
# END WordPress
## EXPIRES CACHING ##
<IfModule mod_expires.c>
ExpiresActive On
ExpiresDefault "access plus 2 days"
ExpiresByType image/jpg "access plus 1 month"
ExpiresByType image/jpeg "access plus 1 month"
ExpiresByType image/gif "access plus 1 month"
ExpiresByType image/png "access plus 1 month"
ExpiresByType image/webp "access plus 1 month"
ExpiresByType image/svg+xml "access plus 1 month"
ExpiresByType text/css "access plus 1 month"
ExpiresByType application/pdf "access plus 1 month"
ExpiresByType text/javascript "access plus 1 month"
ExpiresByType text/x-javascript "access plus 1 month"
ExpiresByType application/x-shockwave-flash "access plus 1 month"
ExpiresByType image/x-icon "access plus 1 month"
ExpiresByType image/icon "access plus 1 month"
ExpiresByType image/vnd.microsoft.icon "access plus 1 month"
ExpiresByType application/x-icon "access plus 1 month"
ExpiresByType application/icon "access plus 1 month"
# Web fonts
# Embedded OpenType (EOT)
ExpiresByType application/vnd.ms-fontobject "access plus 1 month"
ExpiresByType font/eot "access plus 1 month"
# OpenType
ExpiresByType font/opentype "access plus 1 month"
# TrueType
ExpiresByType application/x-font-ttf "access plus 1 month"
# Web Open Font Format (WOFF) 1.0
ExpiresByType application/font-woff "access plus 1 month"
ExpiresByType application/x-font-woff "access plus 1 month"
ExpiresByType font/woff "access plus 1 month"
# Web Open Font Format (WOFF) 2.0
ExpiresByType application/font-woff2 "access plus 1 month"
# Other
ExpiresByType text/x-cross-domain-policy "access plus 1 week"
</IfModule>
## EXPIRES CACHING ##
<IfModule mod_headers.c>
Header always set Strict-Transport-Security "max-age=16070400; includeSubDomains"
Header set X-Content-Type-Options "nosniff"
Header set X-XSS-Protection "1; mode=block"
<FilesMatch "\.(appcache|atom|bbaw|bmp|crx|css|cur|eot|f4[abpv]|flv|geojson|gif|htc|ico|jpe?g|js|json(ld)?|m4[av]|manifest|map|mp4|oex|og[agv]|opus|otf|pdf|png|rdf|rss|safariextz|svgz?|swf|topojson|tt[cf]|txt|vcard|vcf|vtt|webapp|web[mp]|webmanifest|woff2?|xloc|xml|xpi)$">
Header unset X-XSS-Protection
</FilesMatch>
</IfModule>
ServerSignature Off
.htaccess
Protection For POST Abuse (WordPress)
A good strategy is to use .htaccess
to restrict POST
requests to commonly-abused URLs by referrer. Basically, if the referrer isnât your own website, you just discard the request with an error.
Try this, in conjunction with hiding the login address:
<IfModule mod_rewrite.c>
RewriteEngine On
RewriteCond %{REQUEST_METHOD} POST
RewriteCond %{REQUEST_URI} .(wp-comments-post|wp-login)\.php*
RewriteCond %{HTTP_REFERER} !.*example.com.* [OR]
RewriteCond %{HTTP_USER_AGENT} ^$
RewriteRule (.*) http://%{REMOTE_ADDR}/$1 [R=301,L]
</IfModule>
Otherwise, the website (CMS) has to fire up to deal with the request, even if itâs a nonsense request. So itâs a redirect, back to the same page, to check for a real browser instead of a bot. Itâs a redirect to their IP, which will almost certainly kick out an error message. You basically want to redirect the traffic anywhere but your server.