Skip to content

Select from Hugging Face Dataset Hub

If you do not have a dataset relatively available, we provide a dataset hub that offers preloaded datasets. The dataset hub is integrated with platforms like Hugging Face and other popular datasets. You can select a dataset from the hub to get started using Anote.

unstructured

The hugging face dataset hub (https://huggingface.co/datasets) has over 50,000 dataset that we provide access to, below are a few examples on loading datasets from the hub on Anote:

ag_news

ag_news is a news categorization dataset, often used for text classification tasks. It usually consists of titles and descriptions from AG's corpus of news articles, segmented into four categories such as World, Sports, Business, and Science/Technology.

unstructured

unstructured

unstructured

go_emotions

go_emotions is a dataset centered around the granular classification of emotions in text. It's designed to identify a broad range of emotional states from textual data. Multiple emotions can be associated with each sentence, providing a rich representation of human emotional response.

unstructured

unstructured

unstructured

rotten_tomatoes

rotten_tomatoes is a dataset that is often related to movie reviews sourced from the Rotten Tomatoes website. This dataset can contain reviews, ratings, and sentiments, making it useful for tasks like sentiment analysis and recommendation systems.

unstructured

unstructured

unstructured