experimental result #3

wsa-dot · 2022-06-07T08:23:19Z

The result I got was only 65. I don't know what was wrong.

Linda230 · 2022-06-10T08:02:42Z

Hello,
I also get the same result in STS task, I also don't know the reason.

Linda230 · 2022-06-10T08:03:29Z

The result I got was only 65. I don't know what was wrong.

Hi,
Do you find the reason for this result?

wsa-dot · 2022-06-10T08:47:22Z

Maybe he is only good at theoretical analysis, but its hybrid method may not be really effective. We need to come up with some new ways to create really useful hard negatives

…

---Original--- From: ***@***.***> Date: Fri, Jun 10, 2022 16:03 PM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [BDBC-KG-NLP/MixCSE_AAAI2022] experimental result (Issue #3) The result I got was only 65. I don't know what was wrong. Hi, Do you find the reason for this result? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

Linda230 · 2022-06-10T08:54:57Z

Maybe he is only good at theoretical analysis, but its hybrid method may not be really effective. We need to come up with some new ways to create really useful hard negatives
…
---Original--- From: @.> Date: Fri, Jun 10, 2022 16:03 PM To: @.>; Cc: @.@.>; Subject: Re: [BDBC-KG-NLP/MixCSE_AAAI2022] experimental result (Issue #3) The result I got was only 65. I don't know what was wrong. Hi, Do you find the reason for this result? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

Yes, the theoretical analysis is very valuable, but then I'm curious as to how the results in the paper were derived. Because I basically followed the ReadMe to reproduce it, yet has a large gap with the paper result

afalf · 2022-06-12T04:22:08Z

The result I got was only 65. I don't know what was wrong.

Sorry, I have already seen it. Could you please show your hyparameters for training?

Linda230 · 2022-06-13T06:03:43Z

The result I got was only 65. I don't know what was wrong.

Sorry, I have already seen it. Could you please show your hyparameters for training?

Hi, thank you for your reply, here is my hyperparameters setting:

python train.py
--model_name_or_path bert-base-uncased
--train_file data/wiki1m_for_simcse.txt
--eval_path data/sts-dev.tsv
--output_dir $MODEL_PATH
--num_train_epochs 1
--per_device_train_batch_size 64
--learning_rate 3e-5
--max_seq_length 32
--evaluation_strategy steps
--metric_for_best_model stsb_spearman
--load_best_model_at_end
--eval_steps 125
--pooler_type cls
--overwrite_output_dir
--temp 0.05
--do_train
--do_eval
--seed 42
--lambdas 0.6 \

Linda230 · 2022-06-13T10:38:04Z

The result I got was only 65. I don't know what was wrong.

Sorry, I have already seen it. Could you please show your hyparameters for training?

hello, thanks for your reminder, I just set the lambda =0.2 as the paper, then I got an average STS = 77.20 using "cls" pooling, and a higher result STS = 77.90 using "cls_before_pooler", but I think I should follow your ReadMe file, and adopt "cls" pooling, right?

zyznull · 2022-06-14T00:47:10Z

The result I got was only 65. I don't know what was wrong.

Sorry, I have already seen it. Could you please show your hyparameters for training?

hello, thanks for your reminder, I just set the lambda =0.2 as the paper, then I got an average STS = 77.20 using "cls" pooling, and a higher result STS = 77.90 using "cls_before_pooler", but I think I should follow your ReadMe file, and adopt "cls" pooling, right?

Yeah, I find the "cls" pooling is more robutness. And the script is updated now. Thank you for your reminder。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experimental result #3

experimental result #3

wsa-dot commented Jun 7, 2022

Linda230 commented Jun 10, 2022

Linda230 commented Jun 10, 2022

wsa-dot commented Jun 10, 2022 via email

Linda230 commented Jun 10, 2022

afalf commented Jun 12, 2022

Linda230 commented Jun 13, 2022

Linda230 commented Jun 13, 2022

zyznull commented Jun 14, 2022 •

edited

Loading

experimental result #3

experimental result #3

Comments

wsa-dot commented Jun 7, 2022

Linda230 commented Jun 10, 2022

Linda230 commented Jun 10, 2022

wsa-dot commented Jun 10, 2022 via email

Linda230 commented Jun 10, 2022

afalf commented Jun 12, 2022

Linda230 commented Jun 13, 2022

Linda230 commented Jun 13, 2022

zyznull commented Jun 14, 2022 • edited Loading

zyznull commented Jun 14, 2022 •

edited

Loading