Aiven Blog

Jul 15, 2021

Time series or event data? Get less confused

What is time series data? How can you be sure what kind of data you have? Find out how to tell the difference between time series and event data.

lorna-mitchell

Lorna Mitchell

|RSS Feed

Developer Advocate at Aiven

Time series data can be defined as a collection of data points or measurements taken at different times (as opposed to data about many objects concerning a single point in time). So structurally, time series data shares many characteristics with event streams.

In this article, we will go through some tips on telling the two apart.

Does it have a timestamp?

Time series data always has a timestamp; event data usually does as well. What makes this confusing is that very many other data models also include timestamps, so we'll need to investigate further.

Is it ever updated?

Neither time series data nor event streams are ever updated. After the data has been recorded, the records are immutable. Time series data is data collected from a single point in time. Events (as the name implies) refer to a single event. If something else happens later, that's a new data point in both cases.

How many fields does it have?

Time series data typically only has one measurement in each data point. It may also include many "labels" indicating what metric it is, which server and CPU it relates to, and so on.

Events usually have more fields, to provide the detail on the event that occurred. If a user logged in, we'll get the user and perhaps the referrer information, not just a numeric measurement.

Would you graph this value over time?

Another feature of time series data is that since it's usually point-in-time measurement, repeated an unimaginably large number of times both over time and usually across multiple measured items, it's hard to work with the raw data. If it graphs nicely, and especially if it would make sense to ask quantitative questions of this data, then it can be characterised as time series data.

The events transmitting a series of unrelated one-off messages don't fit this model well, so this is another useful distinction.

The right tools for the job

Understanding and modelling your data can really help identify the tools and features you want to deploy in your next application. More complex applications may use multiple data solutions to build the platform required, but whatever your needs, you can pick from the Aiven catalogue of open source products.

For the two examples in today's article, try Aiven for Apache Kafka for streaming events from one place to another, and Aiven for M3 for your time series data needs.

Wrapping up

Not using Aiven services yet? Sign up now for your free trial at https://console.aiven.io/signup!

In the meantime, make sure you follow our changelog and blog RSS feeds or our LinkedIn and Twitter accounts to stay up-to-date with product and feature-related news.


Related resources