What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

fishfree · 2024-05-06T22:20:59Z

For example, CoNLL-U file, I guess the correspondent elements are "text", but for TEI, I'm not even able to guess.

jan-niestadt · 2024-05-07T07:53:15Z

I think there might be some confusion here. The corpusConfig.contentViewable in the .blf.yaml file controls whether the CORPUSNAME/docs/PID/contents operation succeeds or fails. It has no relation to the input document format you're using, so it works the same for CoNLL-U and TEI.

As for what is indexed in an annotated field (usually only one, named contents), that is of course specified in the annotatedFields section of the config file. For example, in the file tei-p5.blf.yaml, what words get indexed for contents is determined by the documentPath and containerPath, so for that file it would be //TEI//text.

Does that answer your question?

fishfree · 2024-05-07T22:01:08Z

@jan-niestadt Thank you very much! the CORPUSNAME/docs/PID/contents operation gets extracted plain text or the formats such as CoNLL-U and TEI?

Checked: the formats such as CoNLL-U and TEI

jan-niestadt · 2024-05-08T07:18:35Z

Yes, BlackLab stores your input document and you can retrieve it at /contents. It can highlight the input document as well if you pass your query to that URL.

fishfree closed this as completed May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

fishfree commented May 6, 2024

jan-niestadt commented May 7, 2024

fishfree commented May 7, 2024 •

edited

Loading

jan-niestadt commented May 8, 2024

What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

Comments

fishfree commented May 6, 2024

jan-niestadt commented May 7, 2024

fishfree commented May 7, 2024 • edited Loading

jan-niestadt commented May 8, 2024

fishfree commented May 7, 2024 •

edited

Loading