Skip to content

Commit

Permalink
robots.txt fun
Browse files Browse the repository at this point in the history
>:)
  • Loading branch information
DynTylluan committed Jul 29, 2024
1 parent c9c1036 commit 1548699
Show file tree
Hide file tree
Showing 4 changed files with 53 additions and 1 deletion.
6 changes: 5 additions & 1 deletion robots.txt
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,9 @@
# Page is under FOPL-ZERO
# https://owly.fans/license/fopl-zero

# Want to know when this page is updated? Follow the RSS!
# https://owly.fans/rss/robots.xml

# This page is also on git:
# Please feel free to suggest a change to this file
# https://github.com/DynTylluan/owly.fans/blob/main/robots.txt (main)
Expand Down Expand Up @@ -64,6 +67,7 @@ Disallow: /
# Used to «improve language models for our speech recognition technology»,
# so more AI rubbish that I don't want from a company that I don't like.
User-Agent: FacebookBot
User-Agent: meta-externalagent
Disallow: /

# Google's AdSense/StoreBot bots
Expand Down Expand Up @@ -150,7 +154,7 @@ Disallow: /
# it, simply remove the «#» before the «User-agent» and «Disallow» part.

# DuckDuckGo
# The search engine website uses the following bot to index sites.
# The search engine website uses the following bots to index sites.
# https://duckduckgo.com/duckduckbot
# https://duckduckgo.com/duckduckgo-help-pages/results/duckduckbot
# https://duckduckgo.com/duckduckgo-help-pages/results/sources
Expand Down
7 changes: 7 additions & 0 deletions rss/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -84,6 +84,13 @@ <h1>RSS Feeds</h1>
<p><a href="../doom/history/">Doom: Rediscovering History</a> is a blog all about Doom (1993) and its many, <em>many</em> mods made for it.</p>

<p>Follow every time a new issue is published: <a href="../doom/history/history.rss"><img src="rss.svg" title="RSS Feed icon." alt="RSS Feed icon." width="19" height="19"></a>



<p><a href="#robots.txt"><h3>robots.txt</h3></a><a name="robots.txt"></a></p>
<p>My <a href="../robots.txt"><tt>robots.txt</tt></a> is used on a few websites by a number of sysops, so as a way of letting people know when a change is made only to this file, this feed was made.</p>

<p>Follow every time a new version is published: <a href="robots.xml"><img src="rss.svg" title="RSS Feed icon." alt="RSS Feed icon." width="19" height="19"></a>

<p></p>
<hr>
Expand Down
Binary file added rss/robot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
41 changes: 41 additions & 0 deletions rss/robots.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<channel>
<title>robots.txt updates</title>
<link>https://owly.fans/rss/robots.xml</link>
<description>See when OwlyFans updates their robots.txt</description>
<language>en-us</language>
<image>
<url>https://owly.fans/rss/robot.png</url>
</image>
<!--
Image credit: Robot by Adrien Coquet
https://commons.wikimedia.org/wiki/File:Noun_Robot_1749584.svg
Edit a little by Cass so there is a blue shadow.
-->

<item>
<title>2024-07-29: This feed is set up</title>
<pubDate>Mon, 29 Jul 2024</pubDate>
<description>
<![CDATA[
This feed is set up as a way for website owners to know when I update my robots.txt as I know that there are people who use what I made on their own site.
<p>
The first update comes thanks to a post by Seirdy, who writes that ®Facebook/Meta updated its robots.txt entry for opting out of GenAI data scraping. If you blocked FacebookBot before, you should block meta-externalagent now [as the bot was renamed]?.
</p>
<p>
It is legitimately scummy that Facebook chose to do this, but regardless, I have decided to block both FacebookBot and meta-externalagent, even if it is technically incorrect to block the former.
</p>
<p>
Thank you to Piper of <a href="https://yarrie.net">yarrie.net</a> for showing me this originally.
</p>
<p>
The Seirdy post: <a href="https://pleroma.envs.net/notice/AkLKKvKad7mzVYN8bY">https://pleroma.envs.net/notice/AkLKKvKad7mzVYN8bY</a>
</p>
]]>
</description>
</item>

</channel>
</rss>

0 comments on commit 1548699

Please sign in to comment.