Jul 15, 2021
Time series or event data? Get less confused
What is time series data? How can you be sure what kind of data you have? Find out how to tell the difference between time series and event data.
Time series data can be defined as a collection of data points or measurements taken at different times (as opposed to data about many objects concerning a single point in time). So structurally, time series data shares many characteristics with event streams.
In this article, we will go through some tips on telling the two apart.
Does it have a timestamp?
Time series data always has a timestamp; event data usually does as well. What makes this confusing is that very many other data models also include timestamps, so we'll need to investigate further.
Is it ever updated?
Neither time series data nor event streams are ever updated. After the data has been recorded, the records are immutable. Time series data is data collected from a single point in time. Events (as the name implies) refer to a single event. If something else happens later, that's a new data point in both cases.
How many fields does it have?
Time series data typically only has one measurement in each data point. It may also include many "labels" indicating what metric it is, which server and CPU it relates to, and so on.
Events usually have more fields, to provide the detail on the event that occurred. If a user logged in, we'll get the user and perhaps the referrer information, not just a numeric measurement.
Would you graph this value over time?
Another feature of time series data is that since it's usually point-in-time measurement, repeated an unimaginably large number of times both over time and usually across multiple measured items, it's hard to work with the raw data. If it graphs nicely, and especially if it would make sense to ask quantitative questions of this data, then it can be characterised as time series data.
The events transmitting a series of unrelated one-off messages don't fit this model well, so this is another useful distinction.
The right tools for the job
Understanding and modelling your data can really help identify the tools and features you want to deploy in your next application. More complex applications may use multiple data solutions to build the platform required, but whatever your needs, you can pick from the Aiven catalogue of open source products.
For the two examples in today's article, try Aiven for Apache Kafka for streaming events from one place to another, and Aiven for M3 for your time series data needs.
Wrapping up
Not using Aiven services yet? Sign up now for your free trial at https://console.aiven.io/signup!
In the meantime, make sure you follow our changelog and blog RSS feeds or our LinkedIn and Twitter accounts to stay up-to-date with product and feature-related news.
Related resources
Dec 5, 2023
New Kafka topics catalog in Aiven for Apache Kafka brings a topic-centric view of your Kafka environment directly to the Aiven console
May 4, 2023
We want to highlight the startups participating in the Cluster Startup Program and building on top of Aiven services. Today, we’re excited to introduce 4.Screen.
Jul 11, 2023
Aiven for Apache Kafka® brings Kafka quotas to help organizations control the resources client applications can use in a Kafka cluster.