Constituency js #916

DeNeutoy · 2018-02-23T18:30:20Z

Adds a Hierplane demo for the constituency parser.
Weaves a new bit of metadata through the model so that we don't get unknown words in the demo (and actually, simplified things quite a bit)
I haven't added the model yet, because it's not good enough and I still need to figure out why. But merging this doesn't depend on that, so I figured it would be best to split it up here.

The demo looks like this:
(We've hit maximum text wrapping capacity on a mac on the demo - we're going to need a "Select a demo model" page pretty soon.... it could look like this but for deep learning models)

joelgrus · 2018-02-23T23:49:32Z

tests/data/dataset_readers/penn_tree_bank_reader_test.py

@@ -47,7 +47,8 @@ def test_read_from_file(self):
                                    "VP(TO to)(VP(VB be)(ADJP(JJ fair)(PP(TO to)(NP(JJ other)(NNS "
                                    "bidders))))))))))))))(. .)))")

-        assert fields["gold_tree"].metadata == gold_tree
+        assert fields["metadata"].metadata["gold_tree"] == gold_tree
+        assert fields["metadata"].metadata["tokens"] == tokens


this always feels clunky to me, I wonder if MetadataField should implement __getitem__? (not in this PR of course, just in general)

We could do this for all of our fields, e.g. for token in TextField etc, etc. I'm a fan

What does for token in TextField return, and when is it called?

For the MetadataField, it was originally a hack to get something working for BiDAF. There is definitely room to improve it.

https://github.com/allenai/allennlp/blob/master/tests/data/dataset_readers/penn_tree_bank_reader_test.py#L24

it would be equivalent to that list comprehension (or probably returning Tokens). Maybe it's less useful because it's unclear what it would return, but it would make it much cleaner.

* 'working' constituency parsing demo * wire the actual sentence through model, predictor and dataset reader * 90% working hierplane vis * tidy up, remove sentence lengths from return dict * clean up some js, remove spans from the demo * better description of the model * add explicit hierplane tree in test

Mark Neumann added 6 commits February 22, 2018 15:42

'working' constituency parsing demo

1d035ac

wire the actual sentence through model, predictor and dataset reader

00a0f10

90% working hierplane vis

a64fb6a

tidy up, remove sentence lengths from return dict

46cbd8c

clean up some js, remove spans from the demo

fd2e566

better description of the model

3ea1c14

DeNeutoy requested review from joelgrus and matt-gardner February 23, 2018 18:46

add explicit hierplane tree in test

2468686

joelgrus approved these changes Feb 23, 2018

View reviewed changes

DeNeutoy merged commit 8b706e4 into allenai:master Feb 24, 2018

DeNeutoy deleted the constituency-js branch February 24, 2018 00:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constituency js #916

Constituency js #916

DeNeutoy commented Feb 23, 2018 •

edited

Loading

joelgrus Feb 23, 2018

DeNeutoy Feb 24, 2018

matt-gardner Feb 24, 2018

DeNeutoy Feb 24, 2018

Constituency js #916

Constituency js #916

Conversation

DeNeutoy commented Feb 23, 2018 • edited Loading

joelgrus Feb 23, 2018

Choose a reason for hiding this comment

DeNeutoy Feb 24, 2018

Choose a reason for hiding this comment

matt-gardner Feb 24, 2018

Choose a reason for hiding this comment

DeNeutoy Feb 24, 2018

Choose a reason for hiding this comment

DeNeutoy commented Feb 23, 2018 •

edited

Loading