Skip to content

Latest commit

 

History

History
29 lines (19 loc) · 1.46 KB

README.md

File metadata and controls

29 lines (19 loc) · 1.46 KB

NumaFlow

Go Report Card GoDoc License

Summary

NumaFlow is a Kubernetes-native platform for running massive parallel data processing and streaming jobs.

A NumaFlow Pipeline is implemented as a Kubernetes custom resource, and consists of one or more source, data processing, and sink vertices.

NumaFlow installs in less than a minute and is easier and cheaper for simple data processing applications than full-featured stream processing platforms.

Key Features

  • Kubernetes-native: If know Kubermetes, you already know 90% of what you need to use NumaFlow.
  • Language agnostic: Use your favority programming language.
  • Exactly-Once semantics: No input element is duplicated or lost even as pods are rescheduled or restarted.

Roadmap

  • Auto-scaling with back-pressure: Each vertex automatically scales from zero to whatever is needed.
  • Data aggregation (e.g. group-by)

Resources