Visual cognition is a key capability of human being, but a big challenge in artificial intelligence. We provide thousands of labeled video sequences and clips to help researchers train their models to "understand" what these videos represent.
Scene parsing is a core capability for autonomous driving technologies. We have collected and annotated a large amount of outdoor scenes captured by vehicle mounted sensors. The whole dataset will evolve to include RGB videos with per pixel annotation and high-accuracy depth, stereoscopic video, and panoramic images.
Machine Reading Comprehension
Machine Reading Comprehension (MRC) is one of the core abilities of artificial intelligence. We release DuReader 2.0, a large-scale real-world Chinese dataset for MRC to promote the research. DuReader 2.0 contains more than 300K questions, 1.4M evident documents and 660K human generated answers. Related competition.Related competition
Open-Domain Information Extraction (OIE) is a task of extracting important information from open-domain sentences. OIE are proven valuable in many artificial intelligence tasks such as text summarization, text comprehension, knowledge-based question answering systems, and more. We release SAOKE dataset, a human annotated dataset containing more than 40 thousand of Chinese sentences and the corresponding facts in SAOKE form.
Schema based Knowledge Extraction (SKE) Dataset offers a large number of real Chinese sentences with manually annotated and SPO triples. It provides a challenging benchmark for evaluating knowledge extraction algorithms bounded by a pre-defined schema.
Traffic Speed Prediction
We provide a large-scale real-world traffic speed prediction dataset - Q-Traffic dataset, which consists of 114 million crowd user queries, geographical attributes and traffic speed of 15,073 road segments.
Entity Recognition and Linking (ERL) is a fundamental task in the research and application of knowledge graph. It identifies entities in a given text and link them to the corresponding entries in a knowledge base. It is the building block for many intelligent systems such as search engine, question and answering system, recommendation system, dialog system. We are releasing the BERL dataset, a large-scale corpus of Chinese short-texts for entity recognition and linking tasks. BERL contains 100K annotated short text, and corresponding mention and links to entities in Baidu Knowledge Base.
We introduce a large-scale dataset of dog species for fine-grained classification tasks, which consists of 300,000 manually-annotated images of 362 dog categories. Being an important animal that is indispensable in our daily life, dog has a natural body configuration for understanding visual attentions. This dataset is hence useful to the developments of our FGVC community.
Fine-grained 3D Pose
In this Fine-Grained 3D Pose Dataset, we augment three existing fine-grained object datasets, i.e., StanfordCars, FGVC-Aircraft and CompCars, with 3D annotations. For each image in these datasets, we annotate the following two things: its corresponding 3D model and 3D pose.