Skip to content

Commit

Permalink
Bump release (#668)
Browse files Browse the repository at this point in the history
* translate no.24

* review 06 cn translations

* review 07 cn translations

* Update 23_what-is-dynamic-padding.srt

* Update 23_what-is-dynamic-padding.srt

* Update 23_what-is-dynamic-padding.srt

* Update subtitles/zh-CN/23_what-is-dynamic-padding.srt

Co-authored-by: Luke Cheng <[email protected]>

* Update subtitles/zh-CN/23_what-is-dynamic-padding.srt

Co-authored-by: Luke Cheng <[email protected]>

* add blank

* Review No. 11, No. 12

* Review No. 13

* Review No. 12

* Review No. 14

* finished review

* optimized translation

* optimized translation

* docs(zh-cn): Reviewed No. 29 - Write your training loop in PyTorch

* Review 15

* Review 16

* Review 17

* Review 18

* Review ch 72 translation

* Update 72 cn translation

* To be reviewed No.42-No.54

* No.11 check-out

* No.12 check-out

* No. 13 14 check-out

* No. 15 16 check-out

* No. 17 18 check-out

* Add note for "token-*"

* Reviewed No.8, 9, 10

* Reviewed No.42

* Review No.43

* finished review

* optimized translation

* finished review

* optimized translation

* Review 44(need refine)

* Review 45(need refine)

* Review No. 46 (need refine)

* Review No.47

* Review No.46

* Review No.45

* Review No.44

* Review No.48

* Review No.49

* Review No.50

* Modify Ko chapter2 8.mdx (#465)

* Add Ko chapter2 2.mdx

* Add Ko chapter2 2.mdx

* Add Ko chapter2 3.mdx & 4.mdx

* Modify Ko chapter2 3.mdx & 4.mdx

* Modify Ko chapter2 3.mdx & 4.mdx

* Modify Ko chapter2 3.mdx & 4.mdx

* Modify _toctree.yml

* Add Ko chapter2 5.mdx

* Modify Ko chapter2 4.mdx

* Add doc-builder step

* Add Ko chapter2 6~8.mdx & Modify Ko chapter2 2.mdx typo

* Modify Ko _toctree.yml

* Modify Ko chapter2 8.mdx & README.md

* Fixed typo (#471)

* fixed subtitle errors (#474)

timestamp: 00:00:26,640 --> 00:00:28,620
modification: notification --> authentication

timestamp: 00:04:21,113 --> 00:04:22,923
modification: of --> or

* Fixed a typo (#475)

* Update 3.mdx (#526)

Fix typo

* [zh-TW] Added chapters 1-9 (#477)

The translation is based on Simplified Chinese version, converted via OpenCC and fixed some formatting issues.

* finished review

* Explain why there are more tokens, than reviews (#476)

* Explain why there are more tokens, than reviews

* Update chapters/en/chapter5/3.mdx

---------

Co-authored-by: lewtun <[email protected]>

* [RU] Subtitles for Chapter 1 of the video course (#489)

* Created a directory for the russian subtitles.

Created a folder for Russian subtitles for the video course and published a translation of the introductory video from chapter 1.

* Uploaded subtitles for chapter 1

Uploaded subtitles for the remaining videos for chapter 1 of the video course.

* Added subtitles for chapter 2 of the video course

Added STR subtitle files for the second chapter of the YouTube video course.

* Delete subtitles/ru directory

Removed the old translation. Incorrect timestamping.

* Create 00_welcome-to-the-hugging-face-course.srt

Create a directory and upload a subtitle file for the introductory video of the course.

* Add files via upload

Upload subtitle files for the first chapter of the course.

* Review No.52

* [ru] Added the glossary and translation guide (#490)

* Added the glossary and translation guide

* Fixed casing

* Minor fixes

* Updated glossary

* Glossary update

* Glossary update

* Glossary update

* [ru] Chapters 0 and 1 proofreading, updating and translating missing sections (#491)

* Chapter 0 proofreading

* Chapter 1 Section 1 proofreading
- Added new people from English version;
- Added links to creator's pages;
- Added FAQ translation;

* Chapter 1 Sections 2-5 proofreading

* Chapter 1 Sections 6-9 proofreading

* Final proofreading and added missing quiz section

* Minor spelling corrections

* Review No.51

* Review No.53

* Review No.54

* finished review

* modified translation

* modified translation

* modified subtitle

use the same text appeared in video

* translated

* Fix typo (#532)

* review chapter4/2

* review chapter4/2

* review chapter4/2

* Review 75

* Review No.20, need review some

* docs(zh-cn): Reviewed Chapter 7/1

* Update 1.mdx

* Review No.22

* Review No.21 (need refinement)

* Review No.30, need review: 26 27 28 30 73 74

* Review 30 (good)

* Review 20

* Review 21 (refine)

* Review 21

* Review 22

* Review 26

* Review 27

* Review 28

* Review 30

* Review 73

* Review 74

* Fix typo

* Review 26-28, 42-54, 73-75

* The GPT2 link is broken

The link `/course/en/chapter7/section6` does not exist in the course.  Corrected to `/course/en/chapter7/6`.

* typo in `Now your turn!` section

Duplicated `the` was removed

* `chunk_size` should be instead of `block_size` 

`chunk_size` should be instead of `block_size` (`block_size` was never mentioned before)

* refactor: rephrase text to improve clarity and specificity

In context to "training with a dataset specific to your task" and "train directly for the final task", I was not able to infer easily that "directly" here implies training from scratch.

* Demo link fixes (#562)

* demo link fixes

* minor demo fix

* Bump release (#566)

* Add note about `remove_unused_columns` for whole word masking

* Merge pull request #24 from huggingface/fix-typo

Fix typo

* Merge pull request #26 from huggingface/fix-qa-offsets

Fix inequalities in answer spans for QA chapter

* Merge pull request #30 from huggingface/fix-modelcard-url

Update model card URL

* Merge pull request #69 from yulonglin/patch-1

Correct typo mixing up space and newline symbols

* Bump release (#99)

* Bump release
* Update author list

Co-authored-by: DOOHAE JUNG <[email protected]>
Co-authored-by: m_khandaker <[email protected]>
Co-authored-by: Md. Al-Amin Khandaker <[email protected]>
Co-authored-by: ftarlaci <[email protected]>
Co-authored-by: Doohae Jung <[email protected]>
Co-authored-by: melaniedrevet <[email protected]>

* Bump release  (#115)

* ko-chapter1/1

* ko _toctree.yml created

* Fix the issue #80

* Single expression changed

* ko/chapter1 finished

* ko/chapter0 finished

* ko/chapter0 finished

* reviewed by @bzantium ko/chapter0

* reviewed by @bzantium chapter0 & fixed typo

* reviewed by @rainmaker712

* maximize Korean expressions

* [Chapter 1] bangla traslation initial commit

* Update 1.mdx

update and fix formating

* Fix formating and typos

* translate _toctree.yml 0-1 chapter

* Add Korean to CI

* [tr] Translated chapter1/2.mdx

* remove translation from sec titles not yet translated

* Add authors [th ru]

* [FIX] _toctree.yml

* Update chapters/bn/chapter0/1.mdx

[FIX] syntax formatting

Co-authored-by: lewtun <[email protected]>

* tag typos & indentation & unnatural expressions

* modified toctree.yml for chapter1/2

* modified toctree.yml for chapter1/2 & fix typo

* French Translation - Chapter 5

* Add Bengali to CI

* Update author list

* Adding translations for 2/4 and 2/5 🚀 (#74)

* Adding translations for 2/4 and 2/5 🚀

* Remove English content

Co-authored-by: lewtun <[email protected]>

* Translation to Russian (#97)

* translation of chapter 2/section 1

* add section 1 / chapter 2 to _toctree.yml

* Translation of Chapter0 to Hindi (#86)

* Hindi?Chapter0-Part_1

* Hindi/Chapter0-Part_2

* Chapter 0 Persian Translation First Draft (#95)

* merged branch0 into main. no toctree yet.

* updated toctree.

* Updated the glossary with terms from chapter0.

* Second draft in collab w/ @schoobani. Added empty chapter1 for preview.

* Glossary typo fix.

* Translation of Chapter0 (setup) to Arabic (#104)

* Add AR translation for `introduction`

* Fix alignment & format

* Add Arabic to CI build

* Russian - Chapter 1 finished (#98)

* 01/4 start

* 1/4 finished

* 1/5 finished

* 1/5 update toc

* 1/6 finished

* 1/7 finished

* 1/8 finished

* 1/9 finished

* 1/4 fix

* toc update

* Chinese - Chapter 1 finished (#113)

* Chinese - Chapter 1 finished

* Add zh to the languages field

Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>

* [PT] Translation of chapter 2 (#107)

* add PT translate to 2.1

* add PT translate to 2.2

* add portuguese translation to 2.3

* WIP portuguese translation to 2.4

* add portuguese translation to 2.4

* add portuguese translation to 2.5

* add portuguese translation to 2.6

* add _toctree infos

Co-authored-by: lewtun <[email protected]>

* [FR] Translation of chapter 2 & event + Review of chapters 0 & 5 (#106)

* Update _toctree.yml

Add chapter 2 
+ little fix of chapter 5

* Update 1.mdx

Review of chapter 0

* Create 1.mdx

* Create 2.mdx

* Create 3.mdx

* Create 4.mdx

* Create 5.mdx

* Create 6.mdx

* Create 7.mdx

* Create 8.mdx

* Update 8.mdx

Since AutoNLP has recently been renamed to AutoTrain, let me make the correction on the English file

* Update 1.mdx

Review of chapter 5/1

* Update 2.mdx

Review of chapter 5/2

* Update 3.mdx

Review of chapter 5/3

* Update 4.mdx

Review of chapter 5/4

* Update 5.mdx

Review of chapter 5/5

* Update 6.mdx

Review of chapter 5/6

* Update 7.mdx

Review of chapter 5/7

* Update 8.mdx

Review of chapter 5/8

* Create 1.mdx

event's translation

* Update _toctree.yml

add event to the tree

* Update _toctree.yml

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* Delete 1.mdx

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* make style correction

* Update _toctree.yml

the -

* Fix spacing

Co-authored-by: Lewis Tunstall <[email protected]>

* [th] Translated Chapter2/1 (#83)

* Finish chapter2/1

* Update _toctree.yml

* ko-chapter1/1

* ko _toctree.yml created

* Fix the issue #80

* Single expression changed

* ko/chapter1 finished

* ko/chapter0 finished

* ko/chapter0 finished

* reviewed by @bzantium ko/chapter0

* reviewed by @bzantium chapter0 & fixed typo

* reviewed by @rainmaker712

* maximize Korean expressions

* [Chapter 1] bangla traslation initial commit

* Update 1.mdx

update and fix formating

* Fix formating and typos

* translate _toctree.yml 0-1 chapter

* Add Korean to CI

* remove translation from sec titles not yet translated

* Add authors [th ru]

* [FIX] _toctree.yml

* Update chapters/bn/chapter0/1.mdx

[FIX] syntax formatting

Co-authored-by: lewtun <[email protected]>

* tag typos & indentation & unnatural expressions

* modified toctree.yml for chapter1/2

* modified toctree.yml for chapter1/2 & fix typo

* Add Bengali to CI

* Update author list

* Adding translations for 2/4 and 2/5 🚀 (#74)

* Adding translations for 2/4 and 2/5 🚀

* Remove English content

Co-authored-by: lewtun <[email protected]>

* Translation to Russian (#97)

* translation of chapter 2/section 1

* add section 1 / chapter 2 to _toctree.yml

* Translation of Chapter0 to Hindi (#86)

* Hindi?Chapter0-Part_1

* Hindi/Chapter0-Part_2

* Chapter 0 Persian Translation First Draft (#95)

* merged branch0 into main. no toctree yet.

* updated toctree.

* Updated the glossary with terms from chapter0.

* Second draft in collab w/ @schoobani. Added empty chapter1 for preview.

* Glossary typo fix.

* Translation of Chapter0 (setup) to Arabic (#104)

* Add AR translation for `introduction`

* Fix alignment & format

* Add Arabic to CI build

* Russian - Chapter 1 finished (#98)

* 01/4 start

* 1/4 finished

* 1/5 finished

* 1/5 update toc

* 1/6 finished

* 1/7 finished

* 1/8 finished

* 1/9 finished

* 1/4 fix

* toc update

* Chinese - Chapter 1 finished (#113)

* Chinese - Chapter 1 finished

* Add zh to the languages field

Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>

* [PT] Translation of chapter 2 (#107)

* add PT translate to 2.1

* add PT translate to 2.2

* add portuguese translation to 2.3

* WIP portuguese translation to 2.4

* add portuguese translation to 2.4

* add portuguese translation to 2.5

* add portuguese translation to 2.6

* add _toctree infos

Co-authored-by: lewtun <[email protected]>

* [FR] Translation of chapter 2 & event + Review of chapters 0 & 5 (#106)

* Update _toctree.yml

Add chapter 2 
+ little fix of chapter 5

* Update 1.mdx

Review of chapter 0

* Create 1.mdx

* Create 2.mdx

* Create 3.mdx

* Create 4.mdx

* Create 5.mdx

* Create 6.mdx

* Create 7.mdx

* Create 8.mdx

* Update 8.mdx

Since AutoNLP has recently been renamed to AutoTrain, let me make the correction on the English file

* Update 1.mdx

Review of chapter 5/1

* Update 2.mdx

Review of chapter 5/2

* Update 3.mdx

Review of chapter 5/3

* Update 4.mdx

Review of chapter 5/4

* Update 5.mdx

Review of chapter 5/5

* Update 6.mdx

Review of chapter 5/6

* Update 7.mdx

Review of chapter 5/7

* Update 8.mdx

Review of chapter 5/8

* Create 1.mdx

event's translation

* Update _toctree.yml

add event to the tree

* Update _toctree.yml

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* Delete 1.mdx

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* make style correction

* Update _toctree.yml

the -

* Fix spacing

Co-authored-by: Lewis Tunstall <[email protected]>

* [th] Translated Chapter2/1 (#83)

* Finish chapter2/1

* Update _toctree.yml

* Add Hindi to CI (#116)

Co-authored-by: DOOHAE JUNG <[email protected]>
Co-authored-by: m_khandaker <[email protected]>
Co-authored-by: Md. Al-Amin Khandaker <[email protected]>
Co-authored-by: ftarlaci <[email protected]>
Co-authored-by: Doohae Jung <[email protected]>
Co-authored-by: melaniedrevet <[email protected]>
Co-authored-by: Jose M Munoz <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Giyaseddin Bayrak <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>

* Bump release 4 (#133)

* Bump release (#138)

* ko-chapter1/1

* ko _toctree.yml created

* Fix the issue #80

* Single expression changed

* ko/chapter1 finished

* ko/chapter0 finished

* ko/chapter0 finished

* reviewed by @bzantium ko/chapter0

* reviewed by @bzantium chapter0 & fixed typo

* reviewed by @rainmaker712

* maximize Korean expressions

* [Chapter 1] bangla traslation initial commit

* Update 1.mdx

update and fix formating

* Fix formating and typos

* translate _toctree.yml 0-1 chapter

* Add Korean to CI

* [tr] Translated chapter1/2.mdx

* remove translation from sec titles not yet translated

* Add authors [th ru]

* [FIX] _toctree.yml

* Update chapters/bn/chapter0/1.mdx

[FIX] syntax formatting

Co-authored-by: lewtun <[email protected]>

* tag typos & indentation & unnatural expressions

* modified toctree.yml for chapter1/2

* modified toctree.yml for chapter1/2 & fix typo

* French Translation - Chapter 5

* Add Bengali to CI

* Update author list

* Adding translations for 2/4 and 2/5 🚀 (#74)

* Adding translations for 2/4 and 2/5 🚀

* Remove English content

Co-authored-by: lewtun <[email protected]>

* Translation to Russian (#97)

* translation of chapter 2/section 1

* add section 1 / chapter 2 to _toctree.yml

* Translation of Chapter0 to Hindi (#86)

* Hindi?Chapter0-Part_1

* Hindi/Chapter0-Part_2

* Chapter 0 Persian Translation First Draft (#95)

* merged branch0 into main. no toctree yet.

* updated toctree.

* Updated the glossary with terms from chapter0.

* Second draft in collab w/ @schoobani. Added empty chapter1 for preview.

* Glossary typo fix.

* Translation of Chapter0 (setup) to Arabic (#104)

* Add AR translation for `introduction`

* Fix alignment & format

* Add Arabic to CI build

* Russian - Chapter 1 finished (#98)

* 01/4 start

* 1/4 finished

* 1/5 finished

* 1/5 update toc

* 1/6 finished

* 1/7 finished

* 1/8 finished

* 1/9 finished

* 1/4 fix

* toc update

* Chinese - Chapter 1 finished (#113)

* Chinese - Chapter 1 finished

* Add zh to the languages field

Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>

* [PT] Translation of chapter 2 (#107)

* add PT translate to 2.1

* add PT translate to 2.2

* add portuguese translation to 2.3

* WIP portuguese translation to 2.4

* add portuguese translation to 2.4

* add portuguese translation to 2.5

* add portuguese translation to 2.6

* add _toctree infos

Co-authored-by: lewtun <[email protected]>

* [FR] Translation of chapter 2 & event + Review of chapters 0 & 5 (#106)

* Update _toctree.yml

Add chapter 2 
+ little fix of chapter 5

* Update 1.mdx

Review of chapter 0

* Create 1.mdx

* Create 2.mdx

* Create 3.mdx

* Create 4.mdx

* Create 5.mdx

* Create 6.mdx

* Create 7.mdx

* Create 8.mdx

* Update 8.mdx

Since AutoNLP has recently been renamed to AutoTrain, let me make the correction on the English file

* Update 1.mdx

Review of chapter 5/1

* Update 2.mdx

Review of chapter 5/2

* Update 3.mdx

Review of chapter 5/3

* Update 4.mdx

Review of chapter 5/4

* Update 5.mdx

Review of chapter 5/5

* Update 6.mdx

Review of chapter 5/6

* Update 7.mdx

Review of chapter 5/7

* Update 8.mdx

Review of chapter 5/8

* Create 1.mdx

event's translation

* Update _toctree.yml

add event to the tree

* Update _toctree.yml

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* Delete 1.mdx

deletion of the files that pose a problem to pass the checks, will be resubmitted in another PR

* make style correction

* Update _toctree.yml

the -

* Fix spacing

Co-authored-by: Lewis Tunstall <[email protected]>

* [th] Translated Chapter2/1 (#83)

* Finish chapter2/1

* Update _toctree.yml

* Add Hindi to CI (#116)

* Update README.md (#87)

* Update authors on README (#120)

* Update authors

* French translation `Chapter1` full (#56)


* traduction 1st part of chapter1

* fix typo

* fix job titles and encoder-decoder translation

* add part 2 for 1st chapter

* fix some typo part2

* fix Transformer -> Transformers

* add part 3 not totally ended

* end of part3 of chapter1

* part9 chapter 1

* add part7 chapter 1

* add part5 chapter 1

* part 6 chapter 1

* add part8 chapter 1

* end quizz of chapter

* add last part of chapter 1

Co-authored-by: ChainYo <[email protected]>

* Translate to Japanese Chapter0 (#123)

* start working

* translate chapter0/1.mdx

* [FA] First draft of Chapter2/Page2 (#129)

* merged branch0 into main. no toctree yet.

* updated toctree.

* Updated the glossary with terms from chapter0.

* Second draft in collab w/ @schoobani. Added empty chapter1 for preview.

* Glossary typo fix.

* Added missing backticks.

* Removed a couple of bad indefinite articles I forgot.

* First draft of ch2/p2. Adds to glossary. Trans. guidelines moved out.

* Fixed missing diacritics, fixed the py/tf switch placing. Other fixes.

* Changed the equivalent for prediction per @kambizG 's direction.

* Redid ambiguous passage in translation per @lewtun 's direction.

* [th] Finished whole Chapter 2 translation (#127)

* Finish chapter2/1

* delete untranslated files

* chapter2/2 WIP

* Delete WIP files

* WIP chapter2/2

* Fixed conflict

* Update _toctree.yml

* Update _toctree.yml

* Finished Chapter2/2

* Finish all chapter2/n

* Finish all chapter2/n

* Fixed Ch2/8 as PR run failed

* [de] Translation Chapter 0 (#130)

* Copy files to newly created german dir (de)

* Add translation for chapter 0

* Clean up english files for chapter 1

* Change _toctree.yml for chapter 0

* Fix whitespaces

* Fix whitespaces again

* Adjust _toctree.yml - leave only chaper 0

* Add German language (de) to workflow yaml files

* [de] German Translation Guide (#132)

* German Translation Guide

* Add German Glossary to TOC

* Chapter 1, Section 1 Bengali translation (#124)

* [ADD] Chapter 1, Section 1 benglai tranlation

* [FIX] toc

* [FIX] commit mistakes

* [FIX] remove the Eng duplicates

Co-authored-by: m_khandaker <[email protected]>

* [FR] Review of chapters 0, 2 & 5 + add chapters 6, 7, 8 & event (#125)

* Create 1.mdx

Event translation

* Create 1.mdx

* Chapter 6 in French

* Update 1.mdx

fix italic

* Update 9.mdx

fix italic

* Update 3.mdx

fix italic

* Update 4.mdx

fix italic

* Update 4.mdx

* Update 1.mdx

little fix

* Update 2.mdx

little fix

* Update 4.mdx

fix italic

* Update 8.mdx

fix italic

* Update 1.mdx

little fix

* Update 2.mdx

little fix

* Update 3.mdx

little fix

* Update 5.mdx

little fix

* Update 7.mdx

little fix

* Update 8.mdx

little fix

* add chapter8

* Update 6.mdx

fix italic

* Update 3.mdx

fix, fix everywhere

* Update 2.mdx

fix, fix everywhere

* Update 4.mdx

fix, fix everywhere

* Update 4_tf.mdx

fix, fix everywhere

* Add files via upload

add chapter 7

* Update 1.mdx

fix links

* Update 2.mdx

fix, fix everywhere

* Update 3.mdx

fix, fix everywhere

* Update 4.mdx

fix, fix everywhere

* Update 5.mdx

* Update 6.mdx

fix, fix everywhere

* Update 7.mdx

fix, fix everywhere

* Update 3.mdx

fix link

* Update 8.mdx

fix link

* Update 2.mdx

fix link

* Update 4.mdx

little fix

* Update 6.mdx

* Update 7.mdx

* Update 8.mdx

fix

* Update 2.mdx

little fix

* Update 3.mdx

little fix

* Update 5.mdx

* Update 4_tf.mdx

little fix

* Update _toctree.yml

Forgot the toctree

* Update _toctree.yml

fix local fields

* Update _toctree.yml

My bad, I forgot some 🙃

* Update 7.mdx

I don't know why it was there...

* Update 1.mdx

* [de] Chapter 3 translation (#128)

* chapter 3 part 1 DE

* [DE] Chapter 3 - Part 2

* Prepare TOC-Tree

* Fein-tuning

* Initial translation

* Glossary additions for C3P3

* C3P2 style

* [de] Chapter 3 P3-TF initial translation

* [de] Chapter 3 P4 initial translation

* [de] Chapter 3 Part 5 initial translation

* [de] Chapter 3 P6 Initial translation

* Missing commas

* fixing quotes

* Mark third option on chapter 8, question 8 as correct (#135)

* doc_change(translation): translating course from english to gujarati (#126)

* change(translation): chapter0 to gujarati

content translated: Chapter0/1.mdx - Introduction

commit-by: [email protected]

* Revert "change(translation): chapter0 to gujarati"

This reverts commit c27e06992af8892687f343a19368ce322d69e8b2.

* doc_change(translation): translation to gj

translated content: chapters/gj/chapter0.mdx - introduction

* doc_change(translation): translation to gj

translated content: chapters/gj/chapter0.mdx - introduction

* Delete _toctree.yml

* change: adding gj to github workflow

* nit: fix heading

* Update authors (#136)

* [FA] First draft of Chapter4/Page1 (#134)

* added chapter4 title and it's first section

* added first draft of Chapter4/Page1

* minor fix

* updated the title according to convention

* applied some updates according to convention

* added footnotes, minor improvements

* applied tweaks according to review points

* the new draft of glossary according to PR #134

* fixed an inconsistant title

* minor fix for better compatibility with T points

* applied final touches for this round of G updates

* [FR] End of chapter 3 + chapter 4  (#137)

* add chapters 3 & 4

* Update 2.mdx

fix links

* Update 3.mdx

some fix

* Update 6.mdx

fix tag

* Update 3.mdx

add link to chapter 7

* Update 3_tf.mdx

add link to chapter 7

* Update _toctree.yml

Co-authored-by: DOOHAE JUNG <[email protected]>
Co-authored-by: m_khandaker <[email protected]>
Co-authored-by: Md. Al-Amin Khandaker <[email protected]>
Co-authored-by: ftarlaci <[email protected]>
Co-authored-by: Doohae Jung <[email protected]>
Co-authored-by: melaniedrevet <[email protected]>
Co-authored-by: Jose M Munoz <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Giyaseddin Bayrak <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Thomas Chaigneau <[email protected]>
Co-authored-by: ChainYo <[email protected]>
Co-authored-by: hiromu <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Marcus Fraaß <[email protected]>
Co-authored-by: Jesper Dramsch <[email protected]>
Co-authored-by: amyeroberts <[email protected]>
Co-authored-by: Ash <[email protected]>
Co-authored-by: Hamed Homaei Rad <[email protected]>

* Bump release (#147)

* Bump release (#161)

* Fix typos in chapter 9 (#176) (#180)

Co-authored-by: regisss <[email protected]>

* Bump release (#187)

* Chapter 2 Section 1 Bengali Translation (huggingface#72) (#168)

* [TH] Chapter 6 Section 1 and 2 (#171)

Co-authored-by: Suteera <[email protected]>

* [FA] CH1 / P1-2 (#142)

* Spanish Chapter 3: sections 1 & 2 (#162)

* fix typos in bpe, wordpiece, unigram (#166)

* [FR] French Review (#186)

* Part 7: Training a causal... fixes (#179)

* typo & error mitigation

* consistency

* Trainer.predict() returns 3 fields

* ran make style

* [TR] Translated Chapter 1.6 🤗 (#185)

* added chapter 1/6 to _toctree.yml

* [TR] Translated Chapter 1.6 🤗

Co-authored-by: Avishek Das <[email protected]>
Co-authored-by: Suteera  Seeha <[email protected]>
Co-authored-by: Suteera <[email protected]>
Co-authored-by: Saeed Choobani <[email protected]>
Co-authored-by: Fermin Ordaz <[email protected]>
Co-authored-by: Kerem Turgutlu <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Sebastian Sosa <[email protected]>
Co-authored-by: tanersekmen <[email protected]>

* Bump release 10 (#194)

* Bump release (#195)

* Bump release 12 (#209)

* Bump release (#224)

* Bump release (#229)

* Bump release (#236)

* Bump release (#258)

* Bump release (#270)

* Bump release (#274)

* Bump release (#286)

* Bump release (#288)

* Chapter 2 Section 1 Bengali Translation (huggingface#72) (#168)

* [TH] Chapter 6 Section 1 and 2 (#171)

Co-authored-by: Suteera <[email protected]>

* [FA] CH1 / P1-2 (#142)

* Spanish Chapter 3: sections 1 & 2 (#162)

* fix typos in bpe, wordpiece, unigram (#166)

* [FR] French Review (#186)

* Part 7: Training a causal... fixes (#179)

* typo & error mitigation

* consistency

* Trainer.predict() returns 3 fields

* ran make style

* [TR] Translated Chapter 1.6 🤗 (#185)

* added chapter 1/6 to _toctree.yml

* [TR] Translated Chapter 1.6 🤗

* [PT][Chapter 01 - 2.mdx] - issue #51 (#170)

* Fix Gradio ToC (#193)

* Add Gradio authors and Blocks event (#189)

* Update 6.mdx (#188)

Correct link to Transformer XL doc

* Add translating notes and glossary to Spanish (#192)

* Add translating notes and glosary to Spanish

* Adding glossary to the toc

* add pt 4.3 (#191)

* [FR] Visual corrections (#190)

* [PT] add chapter 4.4 and 4.5 (#196)

* fix invite discord link (#197)

* [FA] Second draft of CH2/P1-2 (#139)

* added chapter3 in hindi (#198)

* [TR] Chapter 3/1 (#165)

* [RU] Ch3-1/2/3 (#200)

* [PT] add 5.1 and 5.2 (#204)

* [FA] - Ch3 - P1 and P2 (#199)

* [PT] add `end-of-chapter quiz` for chapter 4 (4.6) (#201)


Co-authored-by: lewtun <[email protected]>

* Chapter1: 2.mdx Translated. (#206)

* Remove comments from Persian ToC (#210)

* Fix CI URL for PRs (#211)

* code fragment & english syntax and meaning (#203)

* Updated Ch1/1 with Emoji (#214)

* Add missing numpy import (#217)

* [ES] translate sections 8.1 and 8.2 (#215)

* Fix path to datasets (#216)

* [PT] add 5.3 (#218)

* fix 4.3 (#223)

* Fix notebook generation (#227)

* Add Gradio nb links

* add 5.4 (#226)

* add pt wip (#225)

* Added Gujarati List. (#221)

* Add Gradio nbs links to fr (#228)

* Chinese - Chapter 3finished (#219)

* add ch7 at _toctree and translate 7.1 (#222)

* add 5.5 (#235)

* [FR] Review of chapter 7 (#233)

* Italian translation - chapter 4 (#230)

* Added Thai translation of chapters 3 (#231)

* [Ru] Add part 2, chapter 2 (#234)

* Update 8.mdx (#237)

- Remove Gradio Blocks Party
- Add, Where to next? section

* Created HI/Chapter1/5.mdx (#232)

* Add Spanish chaper3/section4, update toc and glossary (#238)

* [RU] Chapter 3 finished (#239)

* [PT] add 5.6 and 5.7 (#240)

* [EN] Visual corrections (#245)

* Translation for 1/4, 1/5 and 1/6. (#247)

* add event in PT (#250)

* Pin version of black (#252)

* Translate ja event (#241)

* [PT] add quiz chapter 5 (#243)

* Update 5.mdx (#253)

inconsistent naming with line 327

* Translation for Traditional Chinese (zh-tw) chapter0  (#251)


Co-authored-by: Lewis Tunstall <[email protected]>

* Translated the whole Chapter 3 to Thai  (#255)

* Japanese chapter 4 (#244)

* Translation of 1/7, 1/8, and 1/9. (#263)

* [PT] add chapter  8.1 and 8.2 (#265)

* [RU] Chapter 4  (#269)

* Add Thai translation for chapter 6.3b to 6.10 (#268)

* add 8.3 (#266)

* 3.mdx of chapter 01 (#260)

Co-authored-by: Lewis Tunstall <[email protected]>

* Fix typo (#271)

* [PT] add chapter 6.1 (#273)

* add Japanese chapter7 (#267)

* replace `load_metric` with `evaluate.load` (#285)

* update `load_metric` refs to `evaluate.load`

Co-authored-by: lewtun <[email protected]>

* [GJ] Translation to Gujarati - Ch0 Setup (#287)

* [PT] add chapter 6.2 and 6.3 (#279)

* zh-CN - Chapter 4,5finished (#281)

Co-authored-by: Lewis Tunstall <[email protected]>

* Chapter 01 - Done [PT] #51 (#280)

Co-authored-by: Lewis Tunstall <[email protected]>

Co-authored-by: Avishek Das <[email protected]>
Co-authored-by: Suteera  Seeha <[email protected]>
Co-authored-by: Suteera <[email protected]>
Co-authored-by: Saeed Choobani <[email protected]>
Co-authored-by: Fermin Ordaz <[email protected]>
Co-authored-by: Kerem Turgutlu <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Sebastian Sosa <[email protected]>
Co-authored-by: tanersekmen <[email protected]>
Co-authored-by: Victor Costa <[email protected]>
Co-authored-by: Camille Couturier <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Kavya <[email protected]>
Co-authored-by: Batuhan Ayhan <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: Kambiz Ghoorchian <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Diego Vargas <[email protected]>
Co-authored-by: Thomas O'Brien <[email protected]>
Co-authored-by: Lincoln V Schreiber <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: Giorgio Severi <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Ömer Faruk Özdemir <[email protected]>
Co-authored-by: Caterina Bonan <[email protected]>
Co-authored-by: Hiromu Hota <[email protected]>
Co-authored-by: trtd56 <[email protected]>
Co-authored-by: Mehrdad Nezamdoost <[email protected]>
Co-authored-by: Wolvz <[email protected]>
Co-authored-by: a-krirk <[email protected]>
Co-authored-by: atgctg <[email protected]>
Co-authored-by: Thiago Medeiros <[email protected]>
Co-authored-by: webbigdata-jp <[email protected]>
Co-authored-by: Leandro von Werra <[email protected]>
Co-authored-by: Bhadresh Savani <[email protected]>

* Bump release (#295)

* Bump release (#296)

* Bump release (#299)

* Bump release (#305)

* Chinese - Chapter 1 finished

* Add zh to the languages field

 Add zh to the languages field in the build_documentation.yml and build_pr_documentation.yml files

* Remove untranslated chapters in _toctree.yml

Remove all these sections that haven't been translated yet
Remove Chapter 0 from the table of contents since it hasn't been translated yet

* Fixed an error in the translation format

Fixed an error in the translation format of Chapter 1, Section 3

* Added a small part of the missing content

* Fix style

* Complete the translation of Chapters 0 and 2

* Fixed some bugs

·Fixed some formatting errors
·Moved Chapters 0 and 2 to Simplified Chinese

* Add files via upload

Formatting revisions and some translation corrections

* run make style to format chapter1 session3

* run make style to format code

* run make style to format code

* Fix style

* Chapter 2 Section 1 Bengali Translation (huggingface#72) (#168)

* [TH] Chapter 6 Section 1 and 2 (#171)

Co-authored-by: Suteera <[email protected]>

* [FA] CH1 / P1-2 (#142)

* Spanish Chapter 3: sections 1 & 2 (#162)

* fix typos in bpe, wordpiece, unigram (#166)

* [FR] French Review (#186)

* Part 7: Training a causal... fixes (#179)

* typo & error mitigation

* consistency

* Trainer.predict() returns 3 fields

* ran make style

* [TR] Translated Chapter 1.6 🤗 (#185)

* added chapter 1/6 to _toctree.yml

* [TR] Translated Chapter 1.6 🤗

* [PT][Chapter 01 - 2.mdx] - issue #51 (#170)

* Fix Gradio ToC (#193)

* Add Gradio authors and Blocks event (#189)

* Update 6.mdx (#188)

Correct link to Transformer XL doc

* Add translating notes and glossary to Spanish (#192)

* Add translating notes and glosary to Spanish

* Adding glossary to the toc

* add pt 4.3 (#191)

* [FR] Visual corrections (#190)

* [PT] add chapter 4.4 and 4.5 (#196)

* fix invite discord link (#197)

* [FA] Second draft of CH2/P1-2 (#139)

* added chapter3 in hindi (#198)

* [TR] Chapter 3/1 (#165)

* [RU] Ch3-1/2/3 (#200)

* [PT] add 5.1 and 5.2 (#204)

* Add placeholders for audio chapters (#208)

* [FA] - Ch3 - P1 and P2 (#199)

* [PT] add `end-of-chapter quiz` for chapter 4 (4.6) (#201)


Co-authored-by: lewtun <[email protected]>

* Chapter1: 2.mdx Translated. (#206)

* Remove comments from Persian ToC (#210)

* Fix CI URL for PRs (#211)

* code fragment & english syntax and meaning (#203)

* Updated Ch1/1 with Emoji (#214)

* Add missing numpy import (#217)

* Updata chapter3

* Code format for chapter3

* Updata yml file of chapter3

* Uptata yml file of chapter3

* Fix yml file bug

* [ES] translate sections 8.1 and 8.2 (#215)

* Fix path to datasets (#216)

* [PT] add 5.3 (#218)

* fix 4.3 (#223)

* Run make style

* Fix notebook generation (#227)

* Add Gradio nb links

* add 5.4 (#226)

* add pt wip (#225)

* Added Gujarati List. (#221)

* Fix quality

* Add Gradio nbs links to fr (#228)

* Fix ToC tree

* Remove audio templates

* Fix fr section

* Fix fr chapter

* Chinese - Chapter 3finished (#219)

* add ch7 at _toctree and translate 7.1 (#222)

* add 5.5 (#235)

* [FR] Review of chapter 7 (#233)

* Italian translation - chapter 4 (#230)

* Added Thai translation of chapters 3 (#231)

* [Ru] Add part 2, chapter 2 (#234)

* Update 8.mdx (#237)

- Remove Gradio Blocks Party
- Add, Where to next? section

* Created HI/Chapter1/5.mdx (#232)

* Add Spanish chaper3/section4, update toc and glossary (#238)

* [RU] Chapter 3 finished (#239)

* [PT] add 5.6 and 5.7 (#240)

* [EN] Visual corrections (#245)

* Translation for 1/4, 1/5 and 1/6. (#247)

* add event in PT (#250)

* Pin version of black (#252)

* Translate ja event (#241)

* [PT] add quiz chapter 5 (#243)

* Update 5.mdx (#253)

inconsistent naming with line 327

* Translation for Traditional Chinese (zh-tw) chapter0  (#251)


Co-authored-by: Lewis Tunstall <[email protected]>

* Translated the whole Chapter 3 to Thai  (#255)

* Japanese chapter 4 (#244)

* Translation of 1/7, 1/8, and 1/9. (#263)

* [PT] add chapter  8.1 and 8.2 (#265)

* [RU] Chapter 4  (#269)

* Add Thai translation for chapter 6.3b to 6.10 (#268)

* add 8.3 (#266)

* 3.mdx of chapter 01 (#260)

Co-authored-by: Lewis Tunstall <[email protected]>

* Fix typo (#271)

* [PT] add chapter 6.1 (#273)

* add Japanese chapter7 (#267)

* zh-CN - Chapter 4,5finished

* replace `load_metric` with `evaluate.load` (#285)

* update `load_metric` refs to `evaluate.load`

Co-authored-by: lewtun <[email protected]>

* [GJ] Translation to Gujarati - Ch0 Setup (#287)

* [PT] add chapter 6.2 and 6.3 (#279)

* Fix formatting

* Debug formatting

* Debug FR formatting

* zh-CN - Chapter 4,5finished (#281)

Co-authored-by: Lewis Tunstall <[email protected]>

* Chapter 01 - Done [PT] #51 (#280)

Co-authored-by: Lewis Tunstall <[email protected]>

* tf_default_data_collator seems to have moved

* zh-CN - Chapter 6finished

* Revert "Merge branch 'huggingface:main' into main"

This reverts commit aebb46e12f9f87a4303f8bb4f0f2cf545eb83b21, reversing
changes made to 69187a3789e8d3d2d0de821ebe495f111d1cc73d.

* Revert "zh-CN - Chapter 6finished"

This reverts commit e69fce28d3a7b35b76c4f768a6cedf295b37d8c9.

* zh-CN - Chapter 6finished

* fix style

* undo bad commit

* Chapter5it (#278)

* added the italian translation for unit 1 chapter5

Co-authored-by: Leandro von Werra <[email protected]>

* Vietnamese translation (#293)

* Update .github/workflows/build_pr_documentation.yml

Co-authored-by: lewtun <[email protected]>

* Translate JP chapter 8 (#249)

* Italian translation - Chapter 8 (#272)

* Translation to Vietnamese - chapter 5 (#297)

* Add course contributors (#298)

* Add CourseFloatingBanner component

* DocNotebookDropdown -> CourseFloatingBanner

* Italian translation Ch 2/1, 2/2 (#300)

* Add contributors (#304)

* Add forum button (#306)

Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: Avishek Das <[email protected]>
Co-authored-by: Suteera  Seeha <[email protected]>
Co-authored-by: Suteera <[email protected]>
Co-authored-by: Saeed Choobani <[email protected]>
Co-authored-by: Fermin Ordaz <[email protected]>
Co-authored-by: Kerem Turgutlu <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Sebastian Sosa <[email protected]>
Co-authored-by: tanersekmen <[email protected]>
Co-authored-by: Victor Costa <[email protected]>
Co-authored-by: Camille Couturier <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Kavya <[email protected]>
Co-authored-by: Batuhan Ayhan <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: Kambiz Ghoorchian <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Diego Vargas <[email protected]>
Co-authored-by: Thomas O'Brien <[email protected]>
Co-authored-by: Lincoln V Schreiber <[email protected]>
Co-authored-by: Giorgio Severi <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Ömer Faruk Özdemir <[email protected]>
Co-authored-by: Caterina Bonan <[email protected]>
Co-authored-by: Hiromu Hota <[email protected]>
Co-authored-by: trtd56 <[email protected]>
Co-authored-by: Mehrdad Nezamdoost <[email protected]>
Co-authored-by: Wolvz <[email protected]>
Co-authored-by: a-krirk <[email protected]>
Co-authored-by: atgctg <[email protected]>
Co-authored-by: Thiago Medeiros <[email protected]>
Co-authored-by: webbigdata-jp <[email protected]>
Co-authored-by: Leandro von Werra <[email protected]>
Co-authored-by: Bhadresh Savani <[email protected]>
Co-authored-by: Andreas Ehrencrona <[email protected]>
Co-authored-by: leandro <[email protected]>
Co-authored-by: Matt <[email protected]>
Co-authored-by: Nolanogenn <[email protected]>
Co-authored-by: Hồng Hạnh <[email protected]>
Co-authored-by: Younes Belkada <[email protected]>
Co-authored-by: Edoardo Abati <[email protected]>
Co-authored-by: Mishig Davaadorj <[email protected]>
Co-authored-by: Acciaro Gennaro Daniele <[email protected]>

* Bump release (#307)

* Bump release (#308)

* Bump release (#314)

* Bump release (#320)

* Bump release (#328)

* Bump release (#333)

* Bump release (#335)

* Bump release (#343)

* Bump release (#355)

* Bump release (#358)

* Bump release (#371)

* Bump release (#381)

* Bump release (#387)

* Bump release (#404)

* Bump release (#413)

* Bump release (#426)

* Bump release (#463)

---------

Co-authored-by: DOOHAE JUNG <[email protected]>
Co-authored-by: m_khandaker <[email protected]>
Co-authored-by: Md. Al-Amin Khandaker <[email protected]>
Co-authored-by: ftarlaci <[email protected]>
Co-authored-by: Doohae Jung <[email protected]>
Co-authored-by: melaniedrevet <[email protected]>
Co-authored-by: Jose M Munoz <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Giyaseddin Bayrak <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Thomas Chaigneau <[email protected]>
Co-authored-by: ChainYo <[email protected]>
Co-authored-by: hiromu <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Marcus Fraaß <[email protected]>
Co-authored-by: Jesper Dramsch <[email protected]>
Co-authored-by: amyeroberts <[email protected]>
Co-authored-by: Ash <[email protected]>
Co-authored-by: Hamed Homaei Rad <[email protected]>
Co-authored-by: Dawood Khan <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: Avishek Das <[email protected]>
Co-authored-by: Suteera  Seeha <[email protected]>
Co-authored-by: Suteera <[email protected]>
Co-authored-by: Saeed Choobani <[email protected]>
Co-authored-by: Fermin Ordaz <[email protected]>
Co-authored-by: Kerem Turgutlu <[email protected]>
Co-authored-by: Sebastian Sosa <[email protected]>
Co-authored-by: tanersekmen <[email protected]>
Co-authored-by: Victor Costa <[email protected]>
Co-authored-by: Camille Couturier <[email protected]>
Co-authored-by: Kavya <[email protected]>
Co-authored-by: Batuhan Ayhan <[email protected]>
Co-authored-by: Kambiz Ghoorchian <[email protected]>
Co-authored-by: Diego Vargas <[email protected]>
Co-authored-by: Thomas O'Brien <[email protected]>
Co-authored-by: Lincoln V Schreiber <[email protected]>
Co-authored-by: Giorgio Severi <[email protected]>
Co-authored-by: Ömer Faruk Özdemir <[email protected]>
Co-authored-by: Caterina Bonan <[email protected]>
Co-authored-by: Hiromu Hota <[email protected]>
Co-authored-by: trtd56 <[email protected]>
Co-authored-by: Mehrdad Nezamdoost <[email protected]>
Co-authored-by: Wolvz <[email protected]>
Co-authored-by: a-krirk <[email protected]>
Co-authored-by: atgctg <[email protected]>
Co-authored-by: Thiago Medeiros <[email protected]>
Co-authored-by: webbigdata-jp <[email protected]>
Co-authored-by: Leandro von Werra <[email protected]>
Co-authored-by: Bhadresh Savani <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: Andreas Ehrencrona <[email protected]>
Co-authored-by: leandro <[email protected]>
Co-authored-by: Matt <[email protected]>
Co-authored-by: Nolanogenn <[email protected]>
Co-authored-by: Hồng Hạnh <[email protected]>
Co-authored-by: Younes Belkada <[email protected]>
Co-authored-by: Edoardo Abati <[email protected]>
Co-authored-by: Mishig Davaadorj <[email protected]>
Co-authored-by: Acciaro Gennaro Daniele <[email protected]>

* Revert "Bump release (#566)" (#567)

This reverts commit cccc2c91ac8e702e5e14bbb0419dbf0490c7aaaf.

* updated documentation links

* [doc build] Use secrets (#581)

* docs: fix broken links

* changed 'perspires' to 'persists' in chapter 1 quiz

solves issue #585

* Update 4.mdx

You forgot to write a return for this function.

* Update 4.mdx : Fix Typo

Should be "course"

* fix link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Update 2.mdx

updated loading datasets link

* Fix syntax in vi/chapter7/7.mdx

There was an unnecessary `</Tip>`

* Remove `get_lr()` from logs which refers to nonexistent function

`get_lr()` is called as part of this function, but the function is not declared anywhere in the script. This change removes this portion of the code since it is non-necessary.

* Update 4.mdx

removed judgmental argument

* Update en-version

* fix: remove useless token

* fix: remove useless token (#635)

* Translate Chapter 3 to Spanish (#510)

* translate Chapter 3 to Spanish

* translate code comments to Spanish and fix typos

* Translating Chapter 6 to Spanish (#523)

* Translating sections 1 and 2 to spanish

* Translating sections 3 to spanish

* Translating sections 3b to spanish

* Translating sections 4 to spanish

* Translating section 5 to spanish

* Translating section 6 to spanish

* Translating section 7 to spanish

* Translating section 8 to spanish

* Translating section 9 to spanish

* Translating section 10 to spanish

* Adding Sections to _toctree.yml

* Fixing Typos after second review

---------

Co-authored-by: datacubeR <[email protected]>

* Update 5.mdx

Ajuste na tradução de "encoders". São "codificadores", não "decodificadores". Decoders são "decodificadores".

* Update doc CI (#643)

* Фиксация текущих результатов.

* Фиксирую текущее состояние.

* Fixing the transfer results for today.

* Translated files 3b and partially 4. Fixing the result.

* Fixing today's translation.

* fix typos in Spanish translation (#511)

* Fixing today's translation. Files: 6.mdx, 7.mdx and half of 8.mdx.

* The translation of chapter 6 has been completed.

* Delete chapters/en/.ipynb_checkpoints/_toctree-checkpoint.yml

This is backup created by JupyterLab.

* Delete chapters/en/chapter5/.ipynb_checkpoints/8-checkpoint.mdx

This is backup created by JupyterLab.

* Delete chapters/en/chapter6/.ipynb_checkpoints/1-checkpoint.mdx

This is backup created by JupyterLab.

* Delete chapters/en/chapter6/.ipynb_checkpoints/2-checkpoint.mdx

This is backup created by JupyterLab.

* Delete chapters/en/chapter6/.ipynb_checkpoints/8-checkpoint.mdx

This is backup created by JupyterLab.

* Delete chapters/en/chapter6/.ipynb_checkpoints/9-checkpoint.mdx

This is backup created by JupyterLab.

* Delete chapters/ru/.ipynb_checkpoints/TRANSLATING-checkpoint.txt

This is backup created by JupyterLab.

* Delete chapters/ru/.ipynb_checkpoints/_toctree-checkpoint.yml

This is backup created by JupyterLab.

* Delete chapters/ru/chapter5/.ipynb_checkpoints/8-checkpoint.mdx

This is backup created by JupyterLab.

* Update 10.mdx

Minor fix.

* Update 10.mdx

Trying to solve the markup problem.

* Update 10.mdx

Correcting the syntax of some markup again)

* Update chapters/ru/chapter6/4.mdx

Yes, that space is redundant here. You're right about that.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/4.mdx

Extra space. I overlooked it. My mistake.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/3.mdx

There's an extra space here. You're right.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/3.mdx

There's an extra space here. You're right.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/3b.mdx

Yeah, there's no need for a space here.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/3.mdx

Co-authored-by: Maria Khalusova <[email protected]>

* Update 3.mdx

* Update 7.mdx

Translated the comments noted on the review.

* Update 3.mdx

Translated the missing comments in the code.

* Update chapters/ru/chapter6/3b.mdx

Yes, an extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter6/5.mdx

Minor fix.

Co-authored-by: Maria Khalusova <[email protected]>

* Completed the translation of the first part of Chapter 7 into Russian.

* After run python utils/code_formatter.py

* Update chapters/ru/chapter7/1.mdx

Extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/2.mdx

Extra space. I didn't notice.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/2.mdx

Extra space. I didn't notice.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/2.mdx

Yes, indeed, I ate the space bar)))))

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/5.mdx

There's that extra space again.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/5.mdx

There's that extra space again that I didn't notice.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/5.mdx

Extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update 5.mdx

Translated the missing comment.

* Update chapters/ru/chapter7/4.mdx

Extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update 2.mdx

Translated the missing comment in the code

* Update 2.mdx

Translated the missing sentence.

* Update 3.mdx

Translated the missing sentence.

* Update 3.mdx

I agree, it sounds more neutral that way.

* Update chapters/ru/chapter7/3.mdx

An unnecessary parenthesis.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/3.mdx

Also an option, but we've translated it as "карточка модели" a lot of places.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/3.mdx

Extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update 3.mdx

Translated the missing comment in the code.

* Update chapters/ru/chapter7/3.mdx

Extra sapce.

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/4.mdx

Extra space.

Co-authored-by: Maria Khalusova <[email protected]>

* Update 4.mdx

Translated the missing comment in the code.

* Update 5.mdx

Added and translated the missing sentence: "Since the collator expects a list of dicts, where each dict represents a single example in the dataset, we also need to wrangle the data into the expected format before passing it to the data collator:"

* Update 5.mdx

Edit the display of the table on the course page.

* fixed links to other chapters

* fixed links to chapters' intros

* I added myself to the Languages and translations table.

* Deleted unnecessary folder automatically created by JupyterLab.

* Fix links to HF docs

* Finalizing the translation of chapter 7.

* Update 6.mdx

Extra space

* Update 7.mdx

Extra space

* Update chapters/ru/chapter7/6.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/6.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/6.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/7.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/6.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/7.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/7.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/8.mdx

Correction of abbreviation - NLP

Co-authored-by: Maria Khalusova <[email protected]>

* Update 7.mdx

Translated the code commentary

* Update 6.mdx

Translated the missing sentence.

* Update chapters/ru/chapter7/7.mdx

Co-authored-by: Maria Khalusova <[email protected]>

* Update 6.mdx

* Update chapters/ru/chapter7/6.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/7.mdx

Correcting a link

Co-authored-by: Maria Khalusova <[email protected]>

* Update chapters/ru/chapter7/6.mdx

Co-authored-by: Maria Khalusova <[email protected]>

* Fix style

---------

Co-authored-by: researcher <[email protected]>
Co-authored-by: iCell <[email protected]>
Co-authored-by: simpleAI <[email protected]>
Co-authored-by: Luke Cheng <[email protected]>
Co-authored-by: Qi Zhang <[email protected]>
Co-authored-by: Tiezhen WANG <[email protected]>
Co-authored-by: Yuan <[email protected]>
Co-authored-by: FYJNEVERFOLLOWS <[email protected]>
Co-authored-by: zhangchaosd <[email protected]>
Co-authored-by: Kim Bo Geum <[email protected]>
Co-authored-by: TK Buristrakul <[email protected]>
Co-authored-by: Acciaro Gennaro Daniele <[email protected]>
Co-authored-by: Carlos Aguayo <[email protected]>
Co-authored-by: ateliershen <[email protected]>
Co-authored-by: Pavel Nesterov <[email protected]>
Co-authored-by: Artyom Boyko <[email protected]>
Co-authored-by: Kirill Milintsevich <[email protected]>
Co-authored-by: jybarnes21 <[email protected]>
Co-authored-by: gxy-gxy <[email protected]>
Co-authored-by: iLeGend <[email protected]>
Co-authored-by: sj <[email protected]>
Co-authored-by: Sureshkumar Thangavel <[email protected]>
Co-authored-by: Andrei Shirobokov <[email protected]>
Co-authored-by: Pranav <[email protected]>
Co-authored-by: Maria Khalusova <[email protected]>
Co-authored-by: DOOHAE JUNG <[email protected]>
Co-authored-by: m_khandaker <[email protected]>
Co-authored-by: Md. Al-Amin Khandaker <[email protected]>
Co-authored-by: ftarlaci <[email protected]>
Co-authored-by: Doohae Jung <[email protected]>
Co-authored-by: melaniedrevet <[email protected]>
Co-authored-by: Jose M Munoz <[email protected]>
Co-authored-by: svv73 <[email protected]>
Co-authored-by: Vedant Pandya <[email protected]>
Co-authored-by: Bahram Shamshiri <[email protected]>
Co-authored-by: Giyaseddin Bayrak <[email protected]>
Co-authored-by: Pavel <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: petrichor1122 <[email protected]>
Co-authored-by: zhlhyx <[email protected]>
Co-authored-by: João Gustavo A. Amorim <[email protected]>
Co-authored-by: lbourdois <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Thomas Chaigneau <[email protected]>
Co-authored-by: ChainYo <[email protected]>
Co-authored-by: hiromu <[email protected]>
Co-authored-by: Cherdsak Kingkan <[email protected]>
Co-authored-by: Marcus Fraaß <[email protected]>
Co-authored-by: Jesper Dramsch <[email protected]>
Co-authored-by: amyeroberts <[email protected]>
Co-authored-by: Ash <[email protected]>
Co-authored-by: Hamed Homaei Rad <[email protected]>
Co-authored-by: Dawood Khan <[email protected]>
Co-authored-by: regisss <[email protected]>
Co-authored-by: Avishek Das <[email protected]>
Co-authored-by: Suteera  Seeha <[email protected]>
Co-authored-by: Suteera <[email protected]>
Co-authored-by: Saeed Choobani <[email protected]>
Co-authored-by: Fermin Ordaz <[email protected]>
Co-authored-by: Kerem Turgutlu <[email protected]>
Co-authored-by: Sebastian Sosa <[email protected]>
Co-authored-by: tanersekmen <[email protected]>
Co-authored-by: Victor Costa <[email protected]>
Co-authored-by: Camille Couturier <[email protected]>
Co-authored-by: Kavya <[email protected]>
Co-authored-by: Batuhan Ayhan <[email protected]>
Co-authored-by: Kambiz Ghoorchian <[email protected]>
Co-authored-by: Diego Vargas <[email protected]>
Co-authored-by: Thomas O'Brien <[email protected]>
Co-authored-by: Lincoln V Schreiber <[email protected]>
Co-authored-by: Giorgio Severi <[email protected]>
Co-authored-by: Ömer Faruk Özdemir <[email protected]>
Co-authored-by: Caterina Bonan <[email protected]>
Co-authored-by: Hiromu Hota <[email protected]>
Co-authored-by: trtd56 <[email protected]>
Co-authored-by: Mehrdad Nezamdoost <[email protected]>
Co-authored-by: Wolvz <[email protected]>
Co-authored-by: a-krirk <[email protected]>
Co-authored-by: atgctg <[email protected]>
Co-authored-by: Thiago Medeiros <[email protected]>
Co-authored-by: webbigdata-jp <[email protected]>
Co-authored-by: Leandro von Werra <[email protected]>
Co-authored-by: Bhadresh Savani <[email protected]>
Co-authored-by: 1375626371 <[email protected]>
Co-authored-by: Andreas Ehrencrona <[email protected]>
Co-authored-by: leandro <[email protected]>
Co-authored-by: Matt <[email protected]>
Co-authored-by: Nolanogenn <[email protected]>
Co-authored-by: Hồng Hạnh <[email protected]>
Co-authored-by: Younes Belkada <[email protected]>
Co-authored-by: Edoardo Abati <[email protected]>
Co-authored-by: Mishig Davaadorj <[email protected]>
Co-authored-by: nnoboa <[email protected]>
Co-authored-by: Vipula Sandaruwan Dissanayake <[email protected]>
Co-authored-by: Alex Bzdel <[email protected]>
Co-authored-by: JieShen <[email protected]>
Co-authored-by: Hardik Bhadani <[email protected]>
Co-authored-by: Omar Sanseviero <[email protected]>
Co-authored-by: Suket Kamboj <[email protected]>
Co-authored-by: Brad Windsor <[email protected]>
Co-authored-by: Pierre Alexandre SCHEMBRI <[email protected]>
Co-authored-by: Remy <[email protected]>
Co-authored-by: María Grandury <[email protected]>
Co-authored-by: Alfonso Tobar-Arancibia <[email protected]>
Co-authored-by: datacubeR <[email protected]>
Co-authored-by: Alysson <[email protected]>
Co-authored-by: Merve N…
  • Loading branch information
Show file tree
Hide file tree
Showing 100 changed files with 6,846 additions and 219 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ This repo contains the content that's used to create the **[Hugging Face course]
| [Japanese](https://huggingface.co/course/ja/chapter1/1) (WIP) | [`chapters/ja`](https://github.com/huggingface/course/tree/main/chapters/ja) | [@hiromu166](https://github.com/@hiromu166), [@younesbelkada](https://github.com/@younesbelkada), [@HiromuHota](https://github.com/@HiromuHota) |
| [Korean](https://huggingface.co/course/ko/chapter1/1) (WIP) | [`chapters/ko`](https://github.com/huggingface/course/tree/main/chapters/ko) | [@Doohae](https://github.com/Doohae), [@wonhyeongseo](https://github.com/wonhyeongseo), [@dlfrnaos19](https://github.com/dlfrnaos19), [@nsbg](https://github.com/nsbg) |
| [Portuguese](https://huggingface.co/course/pt/chapter1/1) (WIP) | [`chapters/pt`](https://github.com/huggingface/course/tree/main/chapters/pt) | [@johnnv1](https://github.com/johnnv1), [@victorescosta](https://github.com/victorescosta), [@LincolnVS](https://github.com/LincolnVS) |
| [Russian](https://huggingface.co/course/ru/chapter1/1) (WIP) | [`chapters/ru`](https://github.com/huggingface/course/tree/main/chapters/ru) | [@pdumin](https://github.com/pdumin), [@svv73](https://github.com/svv73) |
| [Russian](https://huggingface.co/course/ru/chapter1/1) (WIP) | [`chapters/ru`](https://github.com/huggingface/course/tree/main/chapters/ru) | [@pdumin](https://github.com/pdumin), [@svv73](https://github.com/svv73), [@blademoon](https://github.com/blademoon) |
| [Thai](https://huggingface.co/course/th/chapter1/1) (WIP) | [`chapters/th`](https://github.com/huggingface/course/tree/main/chapters/th) | [@peeraponw](https://github.com/peeraponw), [@a-krirk](https://github.com/a-krirk), [@jomariya23156](https://github.com/jomariya23156), [@ckingkan](https://github.com/ckingkan) |
| [Turkish](https://huggingface.co/course/tr/chapter1/1) (WIP) | [`chapters/tr`](https://github.com/huggingface/course/tree/main/chapters/tr) | [@tanersekmen](https://github.com/tanersekmen), [@mertbozkir](https://github.com/mertbozkir), [@ftarlaci](https://github.com/ftarlaci), [@akkasayaz](https://github.com/akkasayaz) |
| [Vietnamese](https://huggingface.co/course/vi/chapter1/1) | [`chapters/vi`](https://github.com/huggingface/course/tree/main/chapters/vi) | [@honghanhh](https://github.com/honghanhh) |
Expand Down
2 changes: 1 addition & 1 deletion chapters/de/chapter1/3.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ Wenn du einen Text an eine Pipeline übergibst, gibt es drei wichtige Schritte:
3. Die Vorhersagen des Modells werden so nachverarbeitet, sodass du sie nutzen kannst.


Einige der derzeit [verfügbaren Pipelines](https://huggingface.co/transformers/main_classes/pipelines.html) sind:
Einige der derzeit [verfügbaren Pipelines](https://huggingface.co/transformers/main_classes/pipelines) sind:

- `feature-extraction` (Vektordarstellung eines Textes erhalten)
- `fill-mask`
Expand Down
10 changes: 5 additions & 5 deletions chapters/de/chapter1/5.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,8 @@ Rein Encoder-basierte Modelle eignen sich am besten für Aufgaben, die ein Verst

Zu dieser Modellfamilie gehören unter anderem:

- [ALBERT](https://huggingface.co/transformers/model_doc/albert.html)
- [BERT](https://huggingface.co/transformers/model_doc/bert.html)
- [DistilBERT](https://huggingface.co/transformers/model_doc/distilbert.html)
- [ELECTRA](https://huggingface.co/transformers/model_doc/electra.html)
- [RoBERTa](https://huggingface.co/transformers/model_doc/roberta.html)
- [ALBERT](https://huggingface.co/transformers/model_doc/albert)
- [BERT](https://huggingface.co/transformers/model_doc/bert)
- [DistilBERT](https://huggingface.co/transformers/model_doc/distilbert)
- [ELECTRA](https://huggingface.co/transformers/model_doc/electra)
- [RoBERTa](https://huggingface.co/transformers/model_doc/roberta)
6 changes: 3 additions & 3 deletions chapters/de/chapter1/6.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ Diese Modelle sind am besten für Aufgaben geeignet, bei denen es um die Generie

Zu dieser Modellfamilie gehören unter anderem:

- [CTRL](https://huggingface.co/transformers/model_doc/ctrl.html)
- [CTRL](https://huggingface.co/transformers/model_doc/ctrl)
- [GPT](https://huggingface.co/docs/transformers/model_doc/openai-gpt)
- [GPT-2](https://huggingface.co/transformers/model_doc/gpt2.html)
- [Transformer XL](https://huggingface.co/transformers/model_doc/transformerxl.html)
- [GPT-2](https://huggingface.co/transformers/model_doc/gpt2)
- [Transformer XL](https://huggingface.co/transformers/model_doc/transformerxl)
8 changes: 4 additions & 4 deletions chapters/de/chapter1/7.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ Sequence-to-Sequence-Modelle eignen sich am besten für Aufgaben, bei denen es d

Vertreter dieser Modellfamilie sind u. a.:

- [BART](https://huggingface.co/transformers/model_doc/bart.html)
- [mBART](https://huggingface.co/transformers/model_doc/mbart.html)
- [Marian](https://huggingface.co/transformers/model_doc/marian.html)
- [T5](https://huggingface.co/transformers/model_doc/t5.html)
- [BART](https://huggingface.co/transformers/model_doc/bart)
- [mBART](https://huggingface.co/transformers/model_doc/mbart)
- [Marian](https://huggingface.co/transformers/model_doc/marian)
- [T5](https://huggingface.co/transformers/model_doc/t5)
2 changes: 1 addition & 1 deletion chapters/de/chapter3/2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -235,7 +235,7 @@ tokenized_dataset = tokenizer(

Das funktioniert gut, hat aber den Nachteil, dass ein Dictionary zurückgegeben wird (mit unseren Schlüsselwörtern `input_ids`, `attention_mask` und `token_type_ids` und Werten aus Listen von Listen). Es funktioniert auch nur, wenn du genügend RAM hast, um den gesamten Datensatz während der Tokenisierung zu im RAM zwischen zu speichern (während die Datensätze aus der Bibliothek 🤗 Datasets [Apache Arrow](https://arrow.apache.org/) Dateien sind, die auf der Festplatte gespeichert sind, sodass nur die gewünschten Samples im RAM geladen sind).

Um die Daten als Datensatz zu speichern, verwenden wir die Methode [`Dataset.map()`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasets.Dataset.map). Dies gewährt uns zusätzliche Flexibilität, wenn wir zusätzliche Vorverarbeitung als nur die Tokenisierung benötigen. Die `map()`-Methode funktioniert, indem sie eine Funktion auf jedes Element des Datensatzes anwendet, also definieren wir eine Funktion, die unsere Inputs tokenisiert:
Um die Daten als Datensatz zu speichern, verwenden wir die Methode [`Dataset.map()`](https://huggingface.co/docs/datasets/package_reference/main_classes#datasets.Dataset.map). Dies gewährt uns zusätzliche Flexibilität, wenn wir zusätzliche Vorverarbeitung als nur die Tokenisierung benötigen. Die `map()`-Methode funktioniert, indem sie eine Funktion auf jedes Element des Datensatzes anwendet, also definieren wir eine Funktion, die unsere Inputs tokenisiert:

```py
def tokenize_function(example):
Expand Down
4 changes: 2 additions & 2 deletions chapters/de/chapter4/2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ tokenizer = CamembertTokenizer.from_pretrained("camembert-base")
model = CamembertForMaskedLM.from_pretrained("camembert-base")
```

Dennoch empfehlen wir, dass man die [`Auto*` classes](https://huggingface.co/transformers/model_doc/auto.html?highlight=auto#auto-classes) stattdessen benutzt, da diese architekturunabhängig sind. Das vorherige Code-Beispiel gilt nur für Checkpoints, die in die CamemBERT Architektur zu laden sind, aber mit den `Auto*` Klassen kann man Checkpoints ziemlich einfach tauschen:
Dennoch empfehlen wir, dass man die [`Auto*` classes](https://huggingface.co/transformers/model_doc/auto?highlight=auto#auto-classes) stattdessen benutzt, da diese architekturunabhängig sind. Das vorherige Code-Beispiel gilt nur für Checkpoints, die in die CamemBERT Architektur zu laden sind, aber mit den `Auto*` Klassen kann man Checkpoints ziemlich einfach tauschen:

```py
from transformers import AutoTokenizer, AutoModelForMaskedLM
Expand All @@ -81,7 +81,7 @@ tokenizer = CamembertTokenizer.from_pretrained("camembert-base")
model = TFCamembertForMaskedLM.from_pretrained("camembert-base")
```

Hier empfehlen wir auch, dass man stattdessen die [`TFAuto*` classes](https://huggingface.co/transformers/model_doc/auto.html?highlight=auto#auto-classes) benutzt, da diese architekturunabhängig sind. Das vorherige Code-Beispiel gilt nur für Checkpoints, die in die CamemBERT Architektur zu laden sind, aber mit den `TFAuto*` Klassen kann man Checkpoints einfach tauschen:
Hier empfehlen wir auch, dass man stattdessen die [`TFAuto*` classes](https://huggingface.co/transformers/model_doc/auto?highlight=auto#auto-classes) benutzt, da diese architekturunabhängig sind. Das vorherige Code-Beispiel gilt nur für Checkpoints, die in die CamemBERT Architektur zu laden sind, aber mit den `TFAuto*` Klassen kann man Checkpoints einfach tauschen:

```py
from transformers import AutoTokenizer, TFAutoModelForMaskedLM
Expand Down
2 changes: 1 addition & 1 deletion chapters/en/chapter1/3.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ There are three main steps involved when you pass some text to a pipeline:
3. The predictions of the model are post-processed, so you can make sense of them.


Some of the currently [available pipelines](https://huggingface.co/transformers/main_classes/pipelines.html) are:
Some of the currently [available pipelines](https://huggingface.co/transformers/main_classes/pipelines) are:

- `feature-extraction` (get the vector representation of a text)
- `fill-mask`
Expand Down
6 changes: 3 additions & 3 deletions chapters/en/chapter1/6.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ These models are best suited for tasks involving text generation.

Representatives of this family of models include:

- [CTRL](https://huggingface.co/transformers/model_doc/ctrl.html)
- [CTRL](https://huggingface.co/transformers/model_doc/ctrl)
- [GPT](https://huggingface.co/docs/transformers/model_doc/openai-gpt)
- [GPT-2](https://huggingface.co/transformers/model_doc/gpt2.html)
- [Transformer XL](https://huggingface.co/transformers/model_doc/transfo-xl.html)
- [GPT-2](https://huggingface.co/transformers/model_doc/gpt2)
- [Transformer XL](https://huggingface.co/transformers/model_doc/transfo-xl)
8 changes: 4 additions & 4 deletions chapters/en/chapter1/7.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ Sequence-to-sequence models are best suited for tasks revolving around generatin

Representatives of this family of models include:

- [BART](https://huggingface.co/transformers/model_doc/bart.html)
- [mBART](https://huggingface.co/transformers/model_doc/mbart.html)
- [Marian](https://huggingface.co/transformers/model_doc/marian.html)
- [T5](https://huggingface.co/transformers/model_doc/t5.html)
- [BART](https://huggingface.co/transformers/model_doc/bart)
- [mBART](https://huggingface.co/transformers/model_doc/mbart)
- [Marian](https://huggingface.co/transformers/model_doc/marian)
- [T5](https://huggingface.co/transformers/model_doc/t5)
2 changes: 1 addition & 1 deletion chapters/en/chapter3/2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -235,7 +235,7 @@ tokenized_dataset = tokenizer(

This works well, but it has the disadvantage of returning a dictionary (with our keys, `input_ids`, `attention_mask`, and `token_type_ids`, and values that are lists of lists). It will also only work if you have enough RAM to store your whole dataset during the tokenization (whereas the datasets from the 🤗 Datasets library are [Apache Arrow](https://arrow.apache.org/) files stored on the disk, so you only keep the samples you ask for loaded in memory).

To keep the data as a dataset, we will use the [`Dataset.map()`](https://huggingface.co/docs/datasets/package_reference/main_classes.html#datasets.Dataset.map) method. This also allows us some extra flexibility, if we need more preprocessing done than just tokenization. The `map()` method works by applying a function on each element of the dataset, so let's define a function that tokenizes our inputs:
To keep the data as a dataset, we will use the [`Dataset.map()`](https://huggingface.co/docs/datasets/package_reference/main_classes#datasets.Dataset.map) method. This also allows us some extra flexibility, if we need more preprocessing done than just tokenization. The `map()` method works by applying a function on each element of the dataset, so let's define a function that tokenizes our inputs:

```py
def tokenize_function(example):
Expand Down
4 changes: 2 additions & 2 deletions chapters/en/chapter4/2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,7 @@ tokenizer = CamembertTokenizer.from_pretrained("camembert-base")
model = CamembertForMaskedLM.from_pretrained("camembert-base")
```

However, we recommend using the [`Auto*` classes](https://huggingface.co/transformers/model_doc/auto.html?highlight=auto#auto-classes) instead, as these are by design architecture-agnostic. While the previous code sample limits users to checkpoints loadable in the CamemBERT architecture, using the `Auto*` classes makes switching checkpoints simple:
However, we recommend using the [`Auto*` classes](https://huggingface.co/transformers/model_doc/auto?highlight=auto#auto-classes) instead, as these are by design architecture-agnostic. While the previous code sample limits users to checkpoints loadable in the CamemBERT architecture, using the `Auto*` classes makes switching checkpoints simple:

```py
from transformers import AutoTokenizer, AutoModelForMaskedLM
Expand All @@ -81,7 +81,7 @@ tokenizer = CamembertTokenizer.from_pretrained("camembert-base")
model = TFCamembertForMaskedLM.from_pretrained("camembert-base")
```

However, we recommend using the [`TFAuto*` classes](https://huggingface.co/transformers/model_doc/auto.html?highlight=auto#auto-classes) instead, as these are by design architecture-agnostic. While the previous code sample limits users to checkpoints loadable in the CamemBERT architecture, using the `TFAuto*` classes makes switching checkpoints simple:
However, we recommend using the [`TFAuto*` classes](https://huggingface.co/transformers/model_doc/auto?highlight=auto#auto-classes) instead, as these are by design architecture-agnostic. While the previous code sample limits users to checkpoints loadable in the CamemBERT architecture, using the `TFAuto*` classes makes switching checkpoints simple:

```py
from transformers import AutoTokenizer, TFAutoModelForMaskedLM
Expand Down
2 changes: 1 addition & 1 deletion chapters/en/chapter4/3.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -178,7 +178,7 @@ Click on the "Files and versions" tab, and you should see the files visible in t

</Tip>

As you've seen, the `push_to_hub()` method accepts several arguments, making it possible to upload to a specific repository or organization namespace, or to use a different API token. We recommend you take a look at the method specification available directly in the [🤗 Transformers documentation](https://huggingface.co/transformers/model_sharing.html) to get an idea of what is possible.
As you've seen, the `push_to_hub()` method accepts several arguments, making it possible to upload to a specific repository or organization namespace, or to use a different API token. We recommend you take a look at the method specification available directly in the [🤗 Transformers documentation](https://huggingface.co/transformers/model_sharing) to get an idea of what is possible.

The `push_to_hub()` method is backed by the [`huggingface_hub`](https://github.com/huggingface/huggingface_hub) Python package, which offers a direct API to the Hugging Face Hub. It's integrated within 🤗 Transformers and several other machine learning libraries, like [`allenlp`](https://github.com/allenai/allennlp). Although we focus on the 🤗 Transformers integration in this chapter, integrating it into your own code or library is simple.

Expand Down
4 changes: 2 additions & 2 deletions chapters/en/chapter5/2.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -128,7 +128,7 @@ This is exactly what we wanted. Now, we can apply various preprocessing techniqu

<Tip>

The `data_files` argument of the `load_dataset()` function is quite flexible and can be either a single file path, a list of file paths, or a dictionary that maps split names to file paths. You can also glob files that match a specified pattern according to the rules used by the Unix shell (e.g., you can glob all the JSON files in a directory as a single split by setting `data_files="*.json"`). See the 🤗 Datasets [documentation](https://huggingface.co/docs/datasets/loading.html#local-and-remote-files) for more details.
The `data_files` argument of the `load_dataset()` function is quite flexible and can be either a single file path, a list of file paths, or a dictionary that maps split names to file paths. You can also glob files that match a specified pattern according to the rules used by the Unix shell (e.g., you can glob all the JSON files in a directory as a single split by setting `data_files="*.json"`). See the 🤗 Datasets [documentation](https://huggingface.co/docs/datasets/loading#local-and-remote-files) for more details.

</Tip>

Expand Down Expand Up @@ -160,7 +160,7 @@ This returns the same `DatasetDict` object obtained above, but saves us the step

<Tip>

✏️ **Try it out!** Pick another dataset hosted on GitHub or the [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php) and try loading it both locally and remotely using the techniques introduced above. For bonus points, try loading a dataset that’s stored in a CSV or text format (see the [documentation](https://huggingface.co/docs/datasets/loading.html#local-and-remote-files) for more information on these formats).
✏️ **Try it out!** Pick another dataset hosted on GitHub or the [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php) and try loading it both locally and remotely using the techniques introduced above. For bonus points, try loading a dataset that’s stored in a CSV or text format (see the [documentation](https://huggingface.co/docs/datasets/loading#local-and-remote-files) for more information on these formats).

</Tip>

Expand Down
Loading

0 comments on commit b7b8471

Please sign in to comment.