fix: DIA-1523: test coverage for LabelStudioSkill #247

matt-bernstein · 2024-11-05T17:41:48Z

Fuzz testing using a bunch of different label configs and models and making sure predictions are always valid

Uncovered and fixed 2 bugs:

NER postprocessing was only done for the first tag in a label config
text field failed to generate sometimes for NER entities

…verage

matt-bernstein · 2024-11-05T17:46:21Z

black made a bunch of formatting changes, ignore the ones in existing tests, there's just one big new test at the bottom

codecov-commenter · 2024-11-06T18:54:13Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 66.33%. Comparing base (882ca68) to head (64ab9b3).

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #247      +/-   ##
==========================================
+ Coverage   65.71%   66.33%   +0.62%     
==========================================
  Files          47       47              
  Lines        2424     2439      +15     
==========================================
+ Hits         1593     1618      +25     
+ Misses        831      821      -10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tests/test_label_studio_skill.py

adala/skills/collection/label_studio.py

pakelley · 2024-11-06T21:59:19Z

adala/skills/collection/label_studio.py

                input_field_name = ner_tag.objects[0].value.lstrip('$')
                output_field_name = ner_tag.name
                quote_string_field_name = 'text'
-                output = extract_indices(pd.concat([input, output], axis=1), input_field_name, output_field_name, quote_string_field_name)
+                df = pd.concat([input, output], axis=1)
+                output = validate_output_format_for_ner_tag(df, input_field_name, output_field_name)


nice, now we can take out this additional call to validate_output_format_for_ner_tag, but other than that lgtm 👍

Still need it, I added the call to EntityExtraction.extract_indices not the standalone extract_indices

matt-bernstein added 3 commits November 4, 2024 10:36

feat: DIA-1523: test coverage for valid model output for multiskill

a4a2619

Merge remote-tracking branch 'origin/master' into fb-dia-1523-test-co…

3f556f6

…verage

add tests for choices and labels

33729d4

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 5, 2024 17:43 Destroyed

matt-bernstein added 2 commits November 5, 2024 17:31

update test

bc7ab77

fix ner inference

73e792c

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 5, 2024 22:33 Destroyed

bugfix: check ner tags beyond the first one

329bcb2

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 5, 2024 22:58 Destroyed

matt-bernstein added 3 commits November 5, 2024 18:18

don't leak task data in warning logs

7a365bf

add textarea test coverage

6462018

fix test

06ad709

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 6, 2024 00:33 Destroyed

robot-ci-heartex marked this pull request as draft November 6, 2024 11:05

put under vcr

64ab9b3

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 6, 2024 18:51 Destroyed

matt-bernstein marked this pull request as ready for review November 6, 2024 18:52

matt-bernstein changed the title ~~test coverage for LabelStudioSkill~~ fix: DIA-1523: test coverage for LabelStudioSkill Nov 6, 2024

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 6, 2024 18:54 Destroyed

pakelley reviewed Nov 6, 2024

View reviewed changes

tests/test_label_studio_skill.py Outdated Show resolved Hide resolved

adala/skills/collection/label_studio.py Show resolved Hide resolved

matt-bernstein added 2 commits November 6, 2024 14:57

move imports

01c8b8e

apply ner bugfix to legacy skill too

f40c449

matt-bernstein requested a review from pakelley November 6, 2024 19:59

robot-ci-heartex temporarily deployed to fb-dia-1523-test-coverage November 6, 2024 20:00 Destroyed

pakelley reviewed Nov 6, 2024

View reviewed changes

pakelley approved these changes Nov 6, 2024

View reviewed changes

robot-ci-heartex marked this pull request as draft November 7, 2024 08:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: DIA-1523: test coverage for LabelStudioSkill #247

fix: DIA-1523: test coverage for LabelStudioSkill #247

matt-bernstein commented Nov 5, 2024 •

edited

Loading

matt-bernstein commented Nov 5, 2024

codecov-commenter commented Nov 6, 2024

pakelley Nov 6, 2024

matt-bernstein Nov 6, 2024

fix: DIA-1523: test coverage for LabelStudioSkill #247

Are you sure you want to change the base?

fix: DIA-1523: test coverage for LabelStudioSkill #247

Conversation

matt-bernstein commented Nov 5, 2024 • edited Loading

matt-bernstein commented Nov 5, 2024

codecov-commenter commented Nov 6, 2024

Codecov Report

pakelley Nov 6, 2024

Choose a reason for hiding this comment

matt-bernstein Nov 6, 2024

Choose a reason for hiding this comment

matt-bernstein commented Nov 5, 2024 •

edited

Loading