The Quix blog
Ecosystem
September 19, 2023
Apache Kafka vs. RabbitMQ: Comparing architectures, capabilities, and use cases
The main difference between them is that Kafka is an event streaming platform designed to ingest and process massive amounts of data, while RabbitMQ is a general-purpose message broker that supports flexible messaging patterns, multiple protocols, and complex routing.
Ecosystem
August 29, 2023
Apache Beam vs. Apache Spark: Big data processing solutions compared
The main difference between Spark and Beam is that the former enables you to both write and run data processing pipelines, while the latter allows you to write data processing pipelines, and then run them on various external execution environments (runners). But what are the other differences between Spark and Beam, and how are they similar?
Industry insights
August 23, 2023
The anatomy of a machine learning pipeline
Explore the characteristics, challenges, and benefits of machine learning pipelines, and read about the steps involved in training and deploying ML models to production.
Ecosystem
July 19, 2023
Kafka vs Pulsar: Streaming data platforms compared
An in-depth comparison of Apache Kafka and Pulsar, covering criteria such as architectural differences, operational attributes, developer experience, ecosystems, deployment options, and security.
Ecosystem
July 18, 2023
Accelerating AI-ready application development: Quix and Confluent partnership
Teams can now build AI applications on Confluent’s data in motion, with Quix, the AI-ready event streaming application framework.
Ecosystem
July 18, 2023
Unlocking new use cases: Quix and Confluent partnership
Explore the AI applications that you can build when connecting Quix with Confluent.
Industry insights
July 14, 2023
The fundamentals of real-time machine learning
What is real-time machine learning? How is it different from batch ML? What are common real-time ML use cases? What are the challenges of building real-time ML capabilities? All these questions and more are answered in this article.
Industry insights
July 13, 2023
Real-Time infrastructure tooling for data scientists
Explore the evolution of new tools for real-time pipelines that aim to solve the ongoing problem of data scientists' need for more infrastructure expertise.
Industry insights
June 28, 2023
Feature engineering has a language problem
Should data scientists know Java? Java and Scala underpin many real-time, ML-based applications—yet data scientists usually work in Python. Someone has to port the Python into Java or adapt it to use a Python wrapper. Neither of these options is ideal, so what are some better solutions?
Industry insights
June 16, 2023
Time series analysis: a gentle introduction
Explore the fundamentals of time series analysis in this comprehensive article. Learn about key concepts, use cases, and types of time series analysis, and discover models, techniques, and methods to analyze time series data.
Industry insights
June 8, 2023
Telemetry data explained
Gain a thorough understanding of telemetry data and how it works, learn about its benefits, challenges, and applications across different industries, and discover technologies you can use to operationalize telemetry.
Tutorials
May 31, 2023
How to fix the unknown partition error in Kafka
A look at the most common causes of Kafka's "unknown topic or partition" error along with practical steps and solutions to help you fix it.
Ecosystem
May 31, 2023
Apache Kafka vs Apache Flink: friends or rivals?
Explore the unique features and limitations of Apache Kafka and Apache Flink and learn how these open source streaming titans can either join forces or operate independently.
Industry insights
May 30, 2023
The Stream — May 2023 edition
A monthly round-up of the most interesting news coming out of the stream processing ecosystem
Industry insights
May 24, 2023
Bridging the gap between data scientists and engineers in machine learning workflows
Moving code from prototype to production can be tricky—especially for data scientists. There are many challenges in deploying code that needs to calculate features for ML models in real-time. I look at potential solutions to ease the friction.
Ecosystem
May 24, 2023
The drawbacks of ksqlDB in machine learning workflows
Using ksqlDB for real-time feature transformations isn't as easy as it looks. I revisit the strategy to democratize stream processing and examine what's still missing.
Industry insights
April 28, 2023
The Stream — April 2023 edition
A monthly round-up of the most interesting news coming out of the stream processing ecosystem
Tutorials
April 26, 2023
A practical introduction to stream reprocessing in Python
Learn how to reprocess a stream of data with the Quix Streams Python library and Apache Kafka. You'll ingest some GPS telemetry data into a topic and replay the stream to try out different distance calculation methods.
Ecosystem
April 20, 2023
Quix as an Apache Flink alternative: a side-by-side comparison
Explore the differences between Quix and Apache Flink and find out when it's better to use Quix as a Flink alternative. If you’re searching for Apache Flink alternatives, this guide offers a detailed, fair comparison to help you make an informed decision.
Ecosystem
April 12, 2023
Kinesis vs Kafka - A comparison of streaming data platforms
A detailed comparison of Apache Kafka and Amazon Kinesis that covers categories such as operational attributes, pricing model, and time to production while highlighting their key differences and use cases that they typically address.
Industry insights
March 31, 2023
The Stream — March 2023 edition
A monthly round-up of the most interesting news coming out of the stream processing ecosystem
Use Cases
March 28, 2023
Exploring real-time and batch analytics for e-bike telemetry with Quix and AWS
How Brompton's experiments with Quix and AWS technology are paving the way for an enhanced e-bike riding experience.
Tutorials
March 8, 2023
How to use gzip data compression with Apache Kafka and Python
Learn why data compression is vital and how use it with Kafka and kafka-python, focussing on gzip—one of the strongest compression tools that Kafka supports.
Announcements
March 2, 2023
Introducing Quix Streams, an open source library for telemetry data streaming
Lightweight, powerful, no JVM and no need for separate clusters of orchestrators. Here’s a look at our next-gen streaming library for C# and Python developers including feature summaries, code samples, and a sneak peek into our roadmap.