Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add tensor parallelism for RWKV #1237

Open
wants to merge 30 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
4c7cb11
inital tp commits
jahatef Jun 4, 2024
46904d5
setup
jahatef Jun 19, 2024
e2933ef
configs
jahatef Sep 25, 2024
d1112ab
merge
jahatef Oct 3, 2024
43d641d
time mixing tp
jahatef Oct 3, 2024
de02f37
time-mixing
jahatef Oct 11, 2024
dd441b6
time mixing debugging
jahatef Oct 12, 2024
a418670
reset time_faaaa
jahatef Oct 13, 2024
540d856
Add additional asserts and update post training readme (#1300)
AI-WAIFU Oct 8, 2024
12aac35
Fix failling tests (#1301)
AI-WAIFU Oct 8, 2024
97c7915
inital tp commits
jahatef Jun 4, 2024
5f89ed8
merge
jahatef Nov 5, 2024
91cb759
Add ERROR logging prefix and sort the prefixes alphabetically (#1308)
TheBatmanofButler Oct 17, 2024
49b263a
inital tp commits
jahatef Jun 4, 2024
48de682
cleanup
jahatef Nov 6, 2024
c6fac96
cleanup
jahatef Nov 6, 2024
5a259c0
Update local_setup.yml
jahatef Nov 6, 2024
c2d6c85
add Triton FLA
jahatef Nov 10, 2024
bdb3658
change version of rwkv-fla
jahatef Nov 12, 2024
ff7f328
fix a GQA issue (#1314) (#1315)
tiandeyu-cs Nov 13, 2024
1350b2c
fix 'intermediate_size' in Llama configuration files after the 'mlp_t…
tiandeyu-cs Nov 13, 2024
c4d7a54
Python 3.10 support (#1313)
markNZed Nov 13, 2024
ee2f142
Fix documentation for converting SFT/DPO weights back to HF Llama (#1…
jacobthebanana Nov 13, 2024
6e81f0b
fix bug (#1311)
AI-WAIFU Nov 13, 2024
df95419
Add support for dropout in sparse attention (#1312)
michaelc-yu Nov 16, 2024
d682529
adds pyproject files and tests (#1302)
LouisCastricato Nov 16, 2024
0bc11d6
undo merge error (#1325)
Quentin-Anthony Nov 27, 2024
c6db95c
inital tp commits
jahatef Jun 4, 2024
daac503
setup
jahatef Jun 19, 2024
bf478ce
Merge branch 'main' into rwkv-tp
Quentin-Anthony Dec 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Add ERROR logging prefix and sort the prefixes alphabetically (#1308)
* Add ERROR logging prefix and sort alphabetically

* fix comment
  • Loading branch information
TheBatmanofButler authored and jahatef committed Nov 5, 2024
commit 91cb7590531fde52b68bc952571b6bc8ca0b5ea8
13 changes: 8 additions & 5 deletions megatron/neox_arguments/arguments.py
Original file line number Diff line number Diff line change
@@ -50,16 +50,19 @@
ATTENTION_TYPE_CHOICES,
)

### Logging colors ###
### ANSI escape codes ###
END = "\033[0m"
GREEN = "\033[92m"
RED = "\033[91m"
YELLOW = "\033[93m"
END = "\033[0m"
SUCCESS = f"{GREEN} [SUCCESS] {END}"
OKAY = f"{GREEN}[OKAY]{END}"
WARNING = f"{YELLOW}[WARNING]{END}"

### Formatted logging prefixes ###
ERROR = f"{RED}[ERROR]{END} "
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we've properly merged this branch onto upstream main, since this is tracking as a change. Please do this.

FAIL = f"{RED}[FAIL]{END}"
INFO = "[INFO]"
OKAY = f"{GREEN}[OKAY]{END}"
SUCCESS = f"{GREEN} [SUCCESS] {END}"
WARNING = f"{YELLOW}[WARNING]{END}"

# ZERO defaults by deespeed
# These values should not be changed unless defaults in deepspeed are changed