Support Pytorch MaxP Feature/ptmaxp #184

crystina-z · 2021-09-16T15:17:03Z

added PyTorch MaxP
update the output of bertpassage id2vec function, so that it's compatible to both tf-maxp and pt-maxp
update the other extractor accordingly
updated the test case and repro docs

…; remove unnecessary sort

… for each query;

MSMARCO reproductino logs - nima

…xp; on MSP

repro commit for both tf and pt

crystina-z · 2022-05-11T22:54:46Z

capreolus/trainer/pytorch.py

+            #     # REF-TODO: save scheduler state along with optimizer
+            #     self.lr_scheduler.step()
+            # hacky: use step instead the internally calculated epoch to support step-wise lr update
+            self.lr_scheduler.step(epoch=cur_step)


it's a bit hacky here, where by default lr_scheduler.step takes in the epoch; changing here as when we passing epoch=0 into our lr_multiplier and the warmupiter is also 1, the lr would be almost 0 for the entire first epoch.

crystina-z · 2022-05-11T22:56:58Z

capreolus/extractor/bertpassage.py

@@ -222,6 +207,30 @@ def parse_label_tensor(x):
        label = tf.map_fn(parse_label_tensor, parsed_example["label"], dtype=tf.float32)

        return (pos_bert_input, pos_mask, pos_seg, neg_bert_input, neg_mask, neg_seg), label
+
+    def _filter_inputs(self, bert_inputs, bert_masks, bert_segs, n_valid_psg):


Explicitly for training, this function randomly select one passage from the n-passages, this is done in extractor now so that pytorch and tensorflow trainer can both use it.

…ain_feature into two MixIn (depends when they generate list of passage or single passage per query at training time), so that they can be shared by each extractor as needed

lgtm-com · 2022-05-12T05:47:32Z

This pull request introduces 9 alerts when merging db0e405 into a568304 - view on LGTM.com

new alerts:

3 for Unused local variable
3 for Unused import
2 for Conflicting attributes in base classes
1 for Module is imported with 'import' and 'import from'

crystina-z · 2022-05-12T05:48:41Z

capreolus/extractor/birch_bertpassage.py

+
+
+@Extractor.register
+class BirchBertPassage(MultipleTrainingPassagesMixin, BertPassage):


inherit the create_train_features and parse_train_features from MultipleTrainingPassagesMixin, and the other functions from BertPassage

andrewyates · 2022-08-06T09:28:50Z

I got a reasonable dev MRR with pytorch: 0.3548

crystina-z and others added 30 commits September 2, 2021 05:02

first version of benchmark.eval with ir-measures

5f6ab05

benchmark.eval add relevance level support

018e5ca

minor fix

b0a4502

remove msmarco-eval

b25bbd4

clean

90920e9

change all measures into str repr to avoid black problem

2691d76

skip evaluation if there is no matching qids

ee3a0ef

speed up training data prep - use set rather than list for train-qids…

f24bd3d

…; remove unnecessary sort

add pt-maxp (train 30k + rerank top100: MRR@10=0.329)

f915e45

adapt config msmarco for pt monobert

edace93

remove tqdm

8cf1ef4

add decay into msmarco config

bd3d3aa

fix import

2d41e28

add notes to ptmaxp

305ff92

add shape for CE loss

16d6bdc

change sampling logic of pairsampler - sample one pos and neg at once…

9c796c4

… for each query;

shuffle loaded tfrecord dataset

9d891d4

MSMARCO reproductino logs - nima

18c144e

Merge pull request #1 from nimasadri11/master

ce4444d

MSMARCO reproductino logs - nima

tf amp: use both / None to align with pt

ce392ac

ms marco prepro doc; MRR@10=0.352 for pt-maxp; MRR@10=0.354 for tf-ma…

23dcb3f

…xp; on MSP

merge

addcb98

cross entropy; use avg rather than sum

133de84

support firstp, sump, avgp (same score on msp-v1)

24cee86

config for pt-maxp (rob04)

b5e7448

support eval dev and external runfile using external ckpt (dir)

9137bec

Update repro log for MS MARCO passage ranking task

e788e9a

Merge pull request #2 from leungjch/justin/update-repro-oct-19

1c570c3

repro commit for both tf and pt

Update msmarco reproduction log

5d9fe65

Fix markdown

c1bce9b

newline at the end of file

db5e1ee

crystina-z force-pushed the feature/eval+ptmaxp branch from 56ccaeb to db5e1ee Compare May 11, 2022 22:52

crystina-z commented May 11, 2022

View reviewed changes

crystina-z added 6 commits May 12, 2022 00:59

black

ae536a5

dead code

ea7e04a

bugfix

cdd90f3

change the id2vec test case; so that the testing n-passage is 1

95fd1d4

revert quick.md

30f3096

for birch extractor; move the create_tf_train_feature and parse_tf_tr…

db0e405

…ain_feature into two MixIn (depends when they generate list of passage or single passage per query at training time), so that they can be shared by each extractor as needed

crystina-z commented May 12, 2022

View reviewed changes

crystina-z changed the title ~~[WIP] Feature/eval+ptmaxp~~ Support Pytorch MaxP Feature/ptmaxp May 12, 2022

capreolus-ir deleted a comment from lgtm-com bot May 15, 2022

andrewyates self-requested a review August 6, 2022 09:28

andrewyates approved these changes Aug 6, 2022

View reviewed changes

andrewyates merged commit 5946640 into capreolus-ir:master Aug 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Pytorch MaxP Feature/ptmaxp #184

Support Pytorch MaxP Feature/ptmaxp #184

crystina-z commented Sep 16, 2021 •

edited

Loading

crystina-z May 11, 2022

crystina-z May 11, 2022

lgtm-com bot commented May 12, 2022

crystina-z May 12, 2022 •

edited

Loading

andrewyates commented Aug 6, 2022



		@Extractor.register
		class BirchBertPassage(MultipleTrainingPassagesMixin, BertPassage):

Support Pytorch MaxP Feature/ptmaxp #184

Support Pytorch MaxP Feature/ptmaxp #184

Conversation

crystina-z commented Sep 16, 2021 • edited Loading

crystina-z May 11, 2022

Choose a reason for hiding this comment

crystina-z May 11, 2022

Choose a reason for hiding this comment

lgtm-com bot commented May 12, 2022

crystina-z May 12, 2022 • edited Loading

Choose a reason for hiding this comment

andrewyates commented Aug 6, 2022

crystina-z commented Sep 16, 2021 •

edited

Loading

crystina-z May 12, 2022 •

edited

Loading