GTAVDataCollection

GTAVDataCollection is a mod to extract synthetic data from Grand Theft Auto V. The data includes photo-realistic computer images and annotations that can be used for the training of machine learning algorithms.

demo image

English 简体中文

🛠️ Requirements

🚀 Quick Start

  1. Download and install ScriptHookV、ScriptHookVDotNet and Scripted Camera Tool.
  2. Download pre-built binaries and copy them to Grand Theft Auto V/scripts/.
  3. Start the game.
  4. Press T to set the camera.
  5. Press Y to start or stop extracting data. The data should be created in Grand Theft Auto V/data/.

💿 Dataset

Download

A large-scale remote sensing image dataset for vehicle object detection called GTA5-Vehicle has been built based on GTAVDataCollection. Please also note that the data is for research and educational use only.

Baidu Netdisk

Object Category

The dataset includes 15 vehicle object classes: Compacts, Sedans, SUVs, Coupes, Muscle, SportsClassics, Sports, Super, OffRoad, Industrial, Utility, Vans, Service, Emergency, and Commercial. Here are some example images of these objects:

Compacts Compacts_1 Compacts_1 Compacts_1 Compacts_1 Compacts_1
Sedans Sedans_1 Sedans_1 Sedans_1 Sedans_1 Sedans_1
SUVs SUVs_1 SUVs_1 SUVs_1 SUVs_1 SUVs_1
Coupes Coupes_1 Coupes_1 Coupes_1 Coupes_1 Coupes_1
Muscle Muscle_1 Muscle_1 Muscle_1 Muscle_1 Muscle_1
SportsClassics SportsClassics_1 SportsClassics_1 SportsClassics_1 SportsClassics_1 SportsClassics_1
Sports Sports_1 Sports_1 Sports_1 Sports_1 Sports_1
Super Super_1 Super_1 Super_1 Super_1 Super_1
OffRoad OffRoad_1 OffRoad_1 OffRoad_1 OffRoad_1 OffRoad_1
Industrial Industrial_1 Industrial_1 Industrial_1 Industrial_1 Industrial_1
Utility Utility_1 Utility_1 Utility_1 Utility_1 Utility_1
Vans Vans_1 Vans_1 Vans_1 Vans_1 Vans_1
Service Service_1 Service_1 Service_1 Service_1 Service_1
Emergency Emergency_1 Emergency_1 Emergency_1 Emergency_1 Emergency_1
Commercial Commercial_1 Commercial_1 Commercial_1 Commercial_1 Commercial_1

Label Format

Each image has a corresponding label file. Here’s an example of a label structure: The first line contains the image’s dimensions. The second line provides camera pose information, including position and orientation. From the third line onwards, each line represents a target with its index, bounding box coordinates, vehicle category, size, and quality

2048,1152
X:-420.2515 Y:-2069.219 Z:144.7112,X:-89.97202 Y:-70.18397 Z:0
0,1120,255,1149,331,Sedans,small-vehicle,High
1,1100,732,1126,796,Sedans,small-vehicle,High
2,3,701,92,801,Commercial,large-vehicle,High
...

Citation

If you make use of the GTAVDataCollection or GTA5-Vehicle dataset, please cite our following paper


💻 项目地址 | 📬 联系我