Fix Inconsistent NER Grouping (Pipeline) #4987

enzoampil · 2020-06-14T14:07:40Z

This PR solves issue #4816 by:

Applying entity grouping to similar entity types with different prefixes (i.e. B and I)
Ensuring that separate entities at the last filtered index are no longer excluded from grouping.

Running the sample script below (based on reference issue #4816) returns the expected results. Do note that the entity_group is based on the entity_type of the first entity in the group.

from transformers import pipeline
NER_MODEL = "mrm8488/bert-spanish-cased-finetuned-ner"
nlp_ner = pipeline("ner", model=NER_MODEL,
                   grouped_entities=True,
                   tokenizer=(NER_MODEL, {"use_fast": False}))

t = """Consuelo Araújo Noguera, ministra de cultura del presidente Andrés Pastrana (1998.2002) fue asesinada por las Farc luego de haber permanecido secuestrada por algunos meses."""
nlp_ner(t)

[{'entity_group': 'B-PER', 'score': 0.9710702640669686, 'word': 'Consuelo Araújo Noguera'},
 {'entity_group': 'B-PER', 'score': 0.9997273534536362, 'word': 'Andrés Pastrana'},
 {'entity_group': 'B-ORG', 'score': 0.8589080572128296, 'word': 'Farc'}]

I also ran another test to ensure that number 2 (separate entity at the last index) is working properly. I confirmed that it is working properly now.

nlp = pipeline('ner', grouped_entities=False)
nlp("Enzo works at the the UN")
[{'entity': 'I-PER', 'index': 1, 'score': 0.9968166351318359, 'word': 'En'},
 {'entity': 'I-PER', 'index': 2, 'score': 0.9957635998725891, 'word': '##zo'},
 {'entity': 'I-ORG', 'index': 7, 'score': 0.9986497163772583, 'word': 'UN'}]

nlp2 = pipeline('ner', grouped_entities=True)
nlp2("Enzo works at the the UN")
[{'entity_group': 'I-PER', 'score': 0.9962901175022125, 'word': 'Enzo'},
 {'entity_group': 'I-ORG', 'score': 0.9986497163772583, 'word': 'UN'}]

You can test these out yourself in this colab notebook.

cc @dav009 @mfuntowicz

codecov · 2020-06-14T14:14:45Z

Codecov Report

Merging #4987 into master will decrease coverage by 1.40%.
The diff coverage is 91.66%.

@@            Coverage Diff             @@
##           master    #4987      +/-   ##
==========================================
- Coverage   77.83%   76.43%   -1.41%     
==========================================
  Files         141      141              
  Lines       24634    24638       +4     
==========================================
- Hits        19175    18832     -343     
- Misses       5459     5806     +347

Impacted Files	Coverage Δ
src/transformers/pipelines.py	`76.16% <91.66%> (+0.16%)`	⬆️
src/transformers/modeling_tf_mobilebert.py	`23.62% <0.00%> (-73.11%)`	⬇️
src/transformers/modeling_tf_electra.py	`26.92% <0.00%> (-68.47%)`	⬇️
src/transformers/tokenization_roberta.py	`76.71% <0.00%> (-21.92%)`	⬇️
src/transformers/tokenization_utils_base.py	`85.75% <0.00%> (-7.85%)`	⬇️
src/transformers/tokenization_transfo_xl.py	`38.73% <0.00%> (-3.76%)`	⬇️
src/transformers/tokenization_utils_fast.py	`92.02% <0.00%> (-2.18%)`	⬇️
src/transformers/tokenization_openai.py	`82.30% <0.00%> (-1.54%)`	⬇️
src/transformers/tokenization_bert.py	`90.86% <0.00%> (-0.46%)`	⬇️
src/transformers/tokenization_utils.py	`89.55% <0.00%> (-0.41%)`	⬇️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 58cca47...05f50d9. Read the comment docs.

julien-c · 2020-06-14T16:41:51Z

Looks great, but this sort of code/feature also looks like a perfect candidate for more unit-testing coverage.

What do you think?

enzoampil · 2020-06-14T23:16:36Z

Agree, can add these as test cases in test_pipelines.

LysandreJik · 2020-06-16T21:43:57Z

That would be great @enzoampil!

enzoampil · 2020-06-17T15:00:17Z

@julien-c I've added the original issue bug as a test case (Number 1 in the original post). Do note that I only included it in the torch version because mrm8488/bert-spanish-cased-finetuned-ner seems to only work for torch. Please let me know if this is enough for this PR.

For future PRs to add new test cases coming from issues found on top of this (e.g. those from issue #5077), I was hoping to get some guidance on how we'd include them to the test coverage without making it too heavy. For context, different cases are typically based on different models, which means we'll have to run separate models to add them as test cases.

julien-c · 2020-06-17T16:40:16Z

I think we should try to make the tests more unitary, meaning that for instance you would feed them fixed model outputs (no actual forward pass) and check that the actual formatted output is correct.

This might require splitting the call method in smaller more testable functions, which is totally fine IMO.

enzoampil · 2020-06-17T22:35:29Z

I see what you mean. Yes, that makes more sense than running different models. Will work on this.

probavee

Thank you, I was looking for this !
It worked using a Camembert model with 8 different entities, but only when there are entities, else it raises an IndexError.

---------------------------------------------------------------------------

IndexError                                Traceback (most recent call last)

<ipython-input-65-2315ccc0d111> in <module>()
      1 seq6 = "De nombreux particuliers s’interrogent sur la meilleure manière de se dessaisir d’un véhicule accidenté."
----> 2 ner(seq6)

/usr/local/lib/python3.6/dist-packages/transformers/pipelines.py in __call__(self, *args, **kwargs)
    948                 if self.model.config.id2label[label_idx] not in self.ignore_labels
    949             ]
--> 950             last_idx, _ = filtered_labels_idx[-1]
    951 
    952             for idx, label_idx in filtered_labels_idx:

IndexError: list index out of range

It runs by setting ignore_labels=[] in the pipeline instance.

[{'entity_group': 'O',
  'score': 0.969656412601471,
  'word': '<s> De nombreux particuliers s’interrogent sur la meilleure manière de se dessaisir d’un véhicule accidenté.</s>'}]

So I suggest adding a little condition.
Hope it could help !

src/transformers/pipelines.py

Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

…splitting call method to testable functions)

enzoampil · 2020-07-04T18:42:23Z

@julien-c @LysandreJik I've performed the following adjustments to the PR:

I've separated the group_entities function from the raw NER forward pass altogether so that it's easy to run tests that feed fixed model outputs and check that the actual formatted output is correct.

group_entities now takes as an argument a list[dict] of raw NER model outputs, and converts them to the grouped equivalent.

I've added a new NerPipelineTests class in test_pipelines which contains all the NER related tests, and includes new tests for the group_entities function.

The test simply confirms if the expected formatted output (grouped) is equivalent to the actual formatted output given the raw model outputs. For the test cases, I used the two samples from the original PR post. It should be straight forward to continue adding test cases moving forward.

Please do let me know what you guys think! 😄

LysandreJik

This looks very clean! What do you think @julien-c, @mfuntowicz? (Let's wait for v3.0.2 before merging)

julien-c · 2020-07-07T17:29:09Z

Yes, looks good. I would add some typings to (at least) the group_entities and group_sub_entities but we can do that in a subsequent PR.

enzoampil · 2020-07-07T23:36:53Z

@LysandreJik @julien-c Thanks for the feedback. I've added typings for the group_entities and group_sub_entities functions 😄

probavee suggested changes Jun 29, 2020

View reviewed changes

src/transformers/pipelines.py Outdated Show resolved Hide resolved

enzoampil added 7 commits July 4, 2020 10:05

Add B I handling to grouping

7b20245

Add fix to include separate entity as last token

562bd7c

move last_idx definition outside loop

9f0936c

Use first entity in entity group as reference for entity type

d3c4838

Add test cases

9a182ea

Take out extra class accidentally added

7de9685

Return tf ner grouped test to original

4a7a483

enzoampil mentioned this pull request Jul 4, 2020

Don't discard entity_group when token is the last in the sequence. #5439

Merged

enzoampil force-pushed the ner_grouping branch from 34b5f38 to 4a7a483 Compare July 4, 2020 09:23

enzoampil and others added 13 commits July 4, 2020 17:26

Take out redundant last entity

010b784

Get last_idx safely

e1b2d38

Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

Fix first entity comment

0775ef5

Create separate functions for group_sub_entities and group_entities (…

1b097fb

…splitting call method to testable functions)

Take out unnecessary last_idx

1eb4989

Remove additional forward pass test

f3cc9a4

Move token classification basic tests to separate class

b500617

Move token classification basic tests back to monocolumninputtestcase

ff91c62

Move base ner tests to nerpipelinetests

9533bf7

Take out unused kwargs

e719f81

Add back mandatory_keys argument

f8d0a76

Add unitary tests for group_entities in _test_ner_pipeline

f71b178

Fix last entity handling

4a98747

Fix grouping fucntion used

8f29ef9

LysandreJik approved these changes Jul 6, 2020

View reviewed changes

Add typing to group_sub_entities and group_entities

05f50d9

julien-c merged commit 0cc4eae into huggingface:master Jul 8, 2020

julien-c mentioned this pull request Jul 8, 2020

Duplicate grouped entities when using 'ner' pipeline #5609

Closed

4 tasks

This was referenced Jul 9, 2020

NER pipeline: Inconsistent entity grouping #4816

Closed

Several problems with named entites predicted with the ner pipeline #5077

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Inconsistent NER Grouping (Pipeline) #4987

Fix Inconsistent NER Grouping (Pipeline) #4987

enzoampil commented Jun 14, 2020 •

edited

Loading

codecov bot commented Jun 14, 2020 •

edited

Loading

julien-c commented Jun 14, 2020

enzoampil commented Jun 14, 2020

LysandreJik commented Jun 16, 2020

enzoampil commented Jun 17, 2020 •

edited

Loading

julien-c commented Jun 17, 2020

enzoampil commented Jun 17, 2020

probavee left a comment •

edited

Loading

enzoampil commented Jul 4, 2020 •

edited

Loading

LysandreJik left a comment

julien-c commented Jul 7, 2020

enzoampil commented Jul 7, 2020

Fix Inconsistent NER Grouping (Pipeline) #4987

Fix Inconsistent NER Grouping (Pipeline) #4987

Conversation

enzoampil commented Jun 14, 2020 • edited Loading

This PR solves issue #4816 by:

codecov bot commented Jun 14, 2020 • edited Loading

Codecov Report

julien-c commented Jun 14, 2020

enzoampil commented Jun 14, 2020

LysandreJik commented Jun 16, 2020

enzoampil commented Jun 17, 2020 • edited Loading

julien-c commented Jun 17, 2020

enzoampil commented Jun 17, 2020

probavee left a comment • edited Loading

Choose a reason for hiding this comment

enzoampil commented Jul 4, 2020 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

julien-c commented Jul 7, 2020

enzoampil commented Jul 7, 2020

enzoampil commented Jun 14, 2020 •

edited

Loading

codecov bot commented Jun 14, 2020 •

edited

Loading

enzoampil commented Jun 17, 2020 •

edited

Loading

probavee left a comment •

edited

Loading

enzoampil commented Jul 4, 2020 •

edited

Loading