Blog | Tenzir

Parquet & Feather: Enabling Open Investigations

October 7, 2022 · 6 min read

Founder & CEO

Data Engineer

Apache Parquet is the common denominator for structured data at rest. The data science ecosystem has long appreciated this. But infosec? Why should you care about Parquet when building a threat detection and investigation platform? In this blog post series we share our opinionated view on this question. In the next three blog posts, we

describe how VAST uses Parquet and its little brother Feather
benchmark the two formats against each other for typical workloads
share our experience with all the engineering gotchas we encountered along the way

A Git Retrospective

September 15, 2022 · 5 min read

Matthias Vallentin

Founder & CEO

The VAST project is roughly a decade old. But what happened over the last 10 years? This blog post looks back over time through the lens of the git merge commits.

Why merge commits? Because they represent a unit of completed contribution. Feature work takes place in dedicated branches, with the merge to the main branch sealing the deal. Some feature branches have just one commit, whereas others dozens. The distribution is not uniform. As of 6f9c84198 on Sep 2, 2022, there are a total of 13,066 commits, with 2,334 being merges (17.9%). We’ll take a deeper look at the merge commits.

Public Roadmap and Open RFCs

September 7, 2022 · 4 min read

Matthias Vallentin

Founder & CEO

We are happy to announce that we have published our engineering roadmap along with an RFC process to actively participate in shaping upcoming topics. This blog post explains why and how we did it.

VAST v2.3

September 1, 2022 · 4 min read

Dominik Lohmann

Engineering Manager

VAST v2.3 is now available, which introduces an automatic data defragmentation capability.

Richer Typing in Sigma

August 12, 2022 · 5 min read

Matthias Vallentin

Founder & CEO

VAST's Sigma frontend now supports more modifiers. In the Sigma language, modifiers transform predicates in various ways, e.g., to apply a function over a value or to change the operator of a predicate. Modifiers are the customization point to enhance expressiveness of query operations.

The new pySigma effort, which will eventually replace the now-considered-legacy sigma project, comes with new modifiers as well. Most notably, lt, lte, gt, gte provide comparisons over value domains with a total ordering, e.g., numbers: x >= 42. In addition, the cidr modifier interprets a value as subnet, e.g., 10.0.0.0/8. Richer typing!

VAST v2.2

August 5, 2022 · 3 min read

Benno Evers

Principal Engineer

We released VAST v2.2 🙌! Transforms now have a new name: pipelines. The summarize operator also underwent a facelift, making aggregation functions pluggable and allowing for assigning names to output fields.

VAST v2.1

July 7, 2022 · 4 min read

Dominik Lohmann

Engineering Manager

VAST v2.1 is out! This release comes with a particular focus on performance and reducing the size of VAST databases. It brings a new utility for optimizing databases in production, allowing existing deployments to take full advantage of the improvements after upgrading.

Apache Arrow as Platform for Security Data Engineering

June 17, 2022 · 6 min read

Matthias Vallentin

Founder & CEO

VAST bets on Apache Arrow as the open interface to structured data. By "bet," we mean that VAST does not work without Arrow. And we are not alone. Influx's IOx, DataDog's Husky, Anyscale's Ray, TensorBase, and others committed themselves to making Arrow a corner stone of their system architecture. For us, Arrow was not always a required dependency. We shifted to a tighter integration over the years as the Arrow ecosystem matured. In this blog post we explain our journey of becoming an Arrow-native engine.

VAST v2.0

May 16, 2022 · 7 min read

Dominik Lohmann

Engineering Manager

Dear community, we are excited to announce VAST v2.0, bringing faster execution of bulk-submitted queries, improved tunability of index structures, and new configurability through environment variables.

VAST v1.1.2

March 29, 2022 · One min read

Benno Evers

Principal Engineer

Dear community, we are happy to announce the release of VAST v1.1.2, the latest release on the VAST v1.1 series. This release contains a fix for a race condition that could lead to VAST eventually becoming unresponsive to queries in large deployments.

VAST v1.1.1

March 25, 2022 · One min read

Dominik Lohmann

Engineering Manager

Dear community, we are excited to announce VAST v1.1.1.

This release contains some important bug fixes on top of everything included in the VAST v1.1 release.

VAST v1.1

March 3, 2022 · 6 min read

Dominik Lohmann

Engineering Manager

Dear community, we are excited to announce VAST v1.1, which ships with exciting new features: query language plugins to exchange the query expression frontend, and compaction as a mechanism for expressing fine-grained data retention policies and gradually aging out data instead of simply deleting it.

VAST v1.0

January 27, 2022 · 4 min read

Dominik Lohmann

Engineering Manager

We are happy to announce VAST v1.0!

This release brings a new approach to software versioning for Tenzir. We laid out the semantics in detail in a new VERSIONING document.