How to Access and Use the ScanNet RGB-D Video Dataset

May 24, 2024 | Data Science

Welcome to the world of ScanNet, a treasure trove of RGB-D video data that can significantly aid in your research or projects dealing with 3D scene understanding. With over 2.5 million views captured from more than 1500 scans, this dataset comes packed with valuable annotations including 3D camera poses, surface reconstructions, and instance-level semantic segmentations. If you’re excited to dive in, we’ve got you covered!

Downloading the ScanNet Data

To begin your journey with ScanNet, you’ll need to download the data. Here’s how:

  • Fill out an agreement to the ScanNet Terms of Use using your institutional email.
  • Send the completed agreement to scannet@googlegroups.com.
  • If you do not receive a response within a week, please check if your email is bouncing before sending repeated requests.
  • Avoid replying to the noreply email – your email won’t be seen!

For updates on the dataset, check the changelog for the latest information about data releases.

Understanding the Data Organization

The data in ScanNet is organized by RGB-D sequences. Imagine each sequence as a unique book in a library where each path leads to a different story waiting to unfold. Here’s the directory structure that you can expect:

shellscanId
-- scanId.sens    # RGB-D sensor stream with color frames, depth frames, camera poses
-- scanId_vh_clean.ply    # High quality reconstructed mesh
-- scanId_vh_clean_2.ply    # Cleaned mesh for semantic annotations
-- scanId.aggregation.json   # Aggregated semantic annotations
-- scanId_2d-label.zip   # 2D projections of aggregated labels
-- scanId_2d-instance.zip # 2D projections of aggregated instance counts

Each file has its own crucial role in creating the complete picture, like the various ingredients in a recipe. The raw data captures the essence of the scanning job, while the annotations are like the careful seasoning that brings flavor to the final dish.

Data Formats Explained

ScanNet employs several file types to encapsulate its data:

  • Reconstructed surface mesh file (*.ply): Binary format with an upright +Z axis.
  • RGB-D sensor stream (*.sens): Compressed binary format containing color and depth data per frame.
  • Semantic annotation files (*.aggregation.json): Contains identifiers and groups that tie annotations to their respective scenes.
  • 2D annotation projections (*_2d-label.zip and *_2d-instance.zip): These files represent 3D annotations as they appear in the RGB-D frames.

Utilizing ScanNet Tools for Analysis

ScanNet offers various tools for data manipulation and analysis:

  • ScanNet C++ Toolkit: This toolkit allows you to work with the .sens data format.
  • Camera Parameter Estimation Code: Used for computing sensor calibration files.
  • Mesh Segmentation Code: Prepares meshes for semantic annotation.

By leveraging these tools, you can enhance your analysis and ensure the data works to your full advantage.

Troubleshooting Common Issues

If you encounter problems while accessing or using the data, here are some troubleshooting tips:

  • Ensure your institutional email is correctly entered when filling out the agreement.
  • Check your spam folder to see if responses went unnoticed.
  • Validate that you’re sending emails to the correct address without typos.
  • If your request is not acknowledged after a week, double-check for any possible email bounce-back issues.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Now that you’ve learned how to access, organize, and utilize the ScanNet dataset, it’s time to get started on your project! Happy exploring!

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox