Change robots.txt to exclude some URLs (#10037)

- Exclude static assets
- Exclude uploaded files
- Exclude alternate versions of the profile page
- Exclude media proxy URLs
This commit is contained in:
Eugen Rochko 2019-02-13 21:28:18 +01:00 committed by GitHub
parent 7750416597
commit 80161f4351
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 13 additions and 5 deletions

View File

@ -1,5 +1,13 @@
# See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
#
# To ban all spiders from the entire site uncomment the next two lines:
# User-agent: *
# Disallow: /
User-Agent: *
Disallow: /users/*/followers
Disallow: /users/*/following
Disallow: /@*/media
Disallow: /@*/with_replies
Disallow: /@*/tagged/*
Disallow: /media_proxy/*
Disallow: /emoji/*
Disallow: /packs/*
Disallow: /sounds/*
Disallow: /system/*
Disallow: /avatars/*
Disallow: /headers/*