Yelp Open Dataset

An all-purpose dataset for learning

The Yelp dataset is a subset of our businesses, reviews, and user data for use in connection with academic research. Available as JSON files, use it to teach students about databases, to learn NLP, or for sample production data while you learn how to make mobile apps.

The Dataset


6,990,280 reviews

150,346 businesses

200,100 pictures

11 metropolitan areas
  • 908,915 tips by 1,987,897 users
  • Over 1.2 million business attributes like hours, parking, availability, and ambience
  • Aggregated check-ins over time for each of the 131,930 businesses

Get Started

Visit the documentation for information on the structure of the dataset and how to get started.