Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What are the elements correspondent to the contentViewable permission in every supported corpus file format? #519

Closed
fishfree opened this issue May 6, 2024 · 3 comments

Comments

@fishfree
Copy link

fishfree commented May 6, 2024

For example, CoNLL-U file, I guess the correspondent elements are "text", but for TEI, I'm not even able to guess.

@jan-niestadt
Copy link
Member

I think there might be some confusion here. The corpusConfig.contentViewable in the .blf.yaml file controls whether the CORPUSNAME/docs/PID/contents operation succeeds or fails. It has no relation to the input document format you're using, so it works the same for CoNLL-U and TEI.

As for what is indexed in an annotated field (usually only one, named contents), that is of course specified in the annotatedFields section of the config file. For example, in the file tei-p5.blf.yaml, what words get indexed for contents is determined by the documentPath and containerPath, so for that file it would be //TEI//text.

Does that answer your question?

@fishfree
Copy link
Author

fishfree commented May 7, 2024

@jan-niestadt Thank you very much! the CORPUSNAME/docs/PID/contents operation gets extracted plain text or the formats such as CoNLL-U and TEI?

Checked: the formats such as CoNLL-U and TEI

@fishfree fishfree closed this as completed May 8, 2024
@jan-niestadt
Copy link
Member

Yes, BlackLab stores your input document and you can retrieve it at /contents. It can highlight the input document as well if you pass your query to that URL.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants