Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
leogao2 authored Jun 11, 2021
1 parent faa4400 commit fae4694
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# OpenWebText2

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.
This project is part of EleutherAI's quest to create a massive repository of high quality text data for training language models.

Very briefly, OpenWebText2 is a large filtered dataset of text documents scraped from URL found on Reddit submisisons.

Expand All @@ -18,4 +18,4 @@ For further information please visit our [documentation](https://openwebtext2.re
[leogao2](https://github.com/leogao2/) provided overall design guidance, lm_dataformat, and performed another chunk of scraping. <br />
[Colaboratory](https://colab.research.google.com/) VMs helped us with about 10% of our overall scraping. <br />
[The Eye](http://the-eye.eu/) host our processed datasets.<br />
[Read The Docs](https://readthedocs.org/) host our documentation.<br />
[Read The Docs](https://readthedocs.org/) host our documentation.<br />

0 comments on commit fae4694

Please sign in to comment.