Select from Hugging Face Dataset Hub
If you do not have a dataset relatively available, we provide a dataset hub that offers preloaded datasets. The dataset hub is integrated with platforms like Hugging Face and other popular datasets. You can select a dataset from the hub to get started using Anote.
The hugging face dataset hub (https://huggingface.co/datasets) has over 50,000 dataset that we provide access to, below are a few examples on loading datasets from the hub on Anote:
ag_news
ag_news is a news categorization dataset, often used for text classification tasks. It usually consists of titles and descriptions from AG's corpus of news articles, segmented into four categories such as World, Sports, Business, and Science/Technology.
go_emotions
go_emotions is a dataset centered around the granular classification of emotions in text. It's designed to identify a broad range of emotional states from textual data. Multiple emotions can be associated with each sentence, providing a rich representation of human emotional response.
rotten_tomatoes
rotten_tomatoes is a dataset that is often related to movie reviews sourced from the Rotten Tomatoes website. This dataset can contain reviews, ratings, and sentiments, making it useful for tasks like sentiment analysis and recommendation systems.