Overview

The Stanford Natural Language Inference (SNLI) Corpus is a collection of 570k human-written English sentence pairs, manually labeled for balanced classification with the labels entailment, contradiction, and neutral. It serves as a benchmark for evaluating representational systems for text, including those induced by representation-learning methods, and as a resource for developing NLP models. The corpus is used for Natural Language Inference (NLI), also known as Recognizing Textual Entailment (RTE), which is the task of determining the inference relation between two texts. SNLI is distributed in both JSON lines and tab separated value files. Researchers and developers in natural language processing and machine learning use it to train and evaluate models for tasks such as text understanding and semantic reasoning. The corpus includes content from the Flickr 30k and VisualGenome corpora.

Common tasks

Training NLI models Evaluating text representation systems Developing NLP models Benchmarking semantic reasoning capabilities Analyzing sentence relationships Building text understanding systems Researching inference relations between texts

FAQ

View all

Full FAQ is available in the detailed profile.

Pricing

View pricing

Pricing varies

Plan-level pricing details are still being validated for this tool.

Pros & Cons

Pros/cons are still being audited for this tool.

Overview

Common tasks

FAQ

View all

Full FAQ is available in the detailed profile.

Pricing

View pricing

Pricing varies

Plan-level pricing details are still being validated for this tool.

Pros & Cons

Pros/cons are still being audited for this tool.

SNLI

Should you use SNLI?

Overview

FAQ

Pricing

Pros & Cons

Reviews & Ratings

SNLI

Should you use SNLI?

Overview

FAQ

Pricing

Pros & Cons

Reviews & Ratings