Skip to content

Commit

Permalink
docs: Add data discovery tip to import page (#5816)
Browse files Browse the repository at this point in the history
Add a tip to the page on importing data pointing users to the data
discovery functionality.

Affects:
- [X] Community docs
- [X] Enterprise docs

Co-authored-by: caitlinwheeless <caitlin@humansignal.com>
  • Loading branch information
caitlinwheeless and caitlinwheeless committed May 3, 2024
1 parent ecea57a commit 26ec9f5
Showing 1 changed file with 19 additions and 0 deletions.
19 changes: 19 additions & 0 deletions docs/source/guide/tasks.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,24 @@ Get data into Label Studio by importing files, referencing URLs, or syncing with
- If your data is stored locally, [import it into Label Studio](#Import-data-from-a-local-directory).
- If your data contains predictions or pre-annotations, see [Import pre-annotated data into Label Studio](predictions.html).

<div class="enterprise-only">

!!! info Tip
If your data is stored in Google Cloud, AWS, or Azure, you can [import your unstructured data as a dataset in Label Studio Enterprise](dataset_create).

From here, you can use semantic search and similarity search to curate data for labeling, which can then be added to different projects as tasks. For more information, see [Data Discovery overview](dataset_overview).

</div>

<div class="opensource-only">

!!! error Enterprise
If your data is stored in Google Cloud, AWS, or Azure, you can [import your unstructured data as a dataset in Label Studio Enterprise](https://docs.humansignal.com/guide/dataset_create).

From here, you can use semantic search and similarity search to curate data for labeling, which can then be added to different projects as tasks. For more information, see [Data Discovery overview](https://docs.humansignal.com/guide/dataset_overview).

</div>

## General guidelines for importing data

* It’s best to keep about 100k tasks / 100k annotations per project for optimal performance.
Expand All @@ -30,6 +48,7 @@ Get data into Label Studio by importing files, referencing URLs, or syncing with

We strongly recommend that you configure [source storage](storage) instead.


## Types of data you can import into Label Studio

You can import many types of data, including text, timeseries, audio, and image data. The file types supported depend on the type of data.
Expand Down

0 comments on commit 26ec9f5

Please sign in to comment.