How to Get Started with Sneller: SQL for JSON at Scale

Aug 20, 2024 | Programming

Have you ever found yourself buried under heaps of unstructured JSON data and wished for a fast, simple solution to analyze it? Look no further! Sneller, a high-performance SQL engine designed specifically for analyzing petabyte-scale unstructured logs and other event data, is here to help. Whether you want to harness the power of cloud computing or run queries locally, this guide will walk you through the essentials of getting started with Sneller.

Becoming a Test Partner

If you’re interested in becoming a test partner for our serverless cloud offering, please reach out to frank@sneller.io. Joining us could provide you with unique insights and early access to innovative features!

Why Choose Sneller?

  • Cloud Object Storage: Sneller uses cloud object storage as its only backing store, making it an efficient option for large-scale data.
  • Unparalleled Performance: Our SQL VM is implemented in AVX-512 assembly, ensuring throughput exceeding terabytes per second.
  • Schemaless SQL: Forget the hassles of ETL processes; Sneller is completely schemaless, allowing direct ingestion of heterogeneous JSON data.
  • Hybrid Data Layouts: Experience lightweight ingest, low storage footprint, and super-fast scanning speeds with our innovative approach combining columnar and row-oriented layouts.

Accessing Sneller Cloud

Sneller Cloud provides a hosted version of the Sneller SQL engine running directly on data stored entirely in your S3 buckets. With competitive pricing at just $150 per petabyte of data scanned, it is an excellent choice for those handling significant amounts of data.

Exploring the Browser Demo

Want to try Sneller without any setup? You can run queries for free through our playground. This interactive platform includes a public table containing about 1 billion rows sourced from the GitHub archive dataset.

Running Queries Locally

If you have Go installed and your machine supports AVX-512, you can run the Sneller query engine locally. Here’s how you can get started:

console$ grep -q avx512 /proc/cpuinfo
echo yes, I have AVX512
yes, I have AVX512

# install the sdb tool (make sure $GOBIN is in your $PATH)
$ go install github.com/SnellerInc/snellercmd/sdb@latest

# pack a JSON object into a table that can be queried
$ wget https://data.gharchive.org/2015-01-01-15.json.gz
$ sdb pack -o github.zion 2015-01-01-15.json.gz

# run a query, using JSON as the output format
$ sdb query -v -fmt=json select count(*), type from read_file(github.zion) group by type

Understanding the Code

Think of the above code as a recipe to whip up a delicious data analysis dish. First, you verify you have the main ingredient (AVX-512 support). Next, you gather your tools (installing the sdb tool), much like a chef preparing their utensils. You then pack your selected data (the JSON object) into a container (the table), and finally, you execute your query, akin to serving the finished dish and enjoying the results.

Performance Insights

Sneller excels in performance, typically achieving scanning rates exceeding 1GB/sec on high-core-count machines. The SQL engine efficiently utilizes the memory bandwidth, leading to a swift query execution. Remember, scaling up your CPU cores directly boosts your scanning performance!

Troubleshooting Tips

  • If you experience issues with AVX-512 support, ensure that your machine meets the hardware requirements.
  • For connectivity issues with Sneller Cloud, check your internet connection and ensure that all configured settings are correct.
  • If you encounter performance issues, consider optimizing your query or upgrading your instance if running on a cloud provider.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox