Forward Deployed Software Engineer – Code Challenge

Welcome to the FDSE coding challenge! This challenge is designed to evaluate your ability to build data pipelines whiles leveraging AI tools effectively. This is not designed to be build without AI help, as time needed would greatly surpass reasonable time allocation for any code challenge for any job.

Goal

Build a small data pipeline that:

Subscribes to an MQTT broker streaming test rig measurements
Normalizes raw "vertical" messages into a horizontal (wide) schema by timestamp & rig
Sinks the normalized data into a database of your choice (e.g., InfluxDB, Postgres, DuckDB, SQLite)
Visualizes the result in a notebook (JupyterLab / Marimo / etc.)
Runs in Docker (bonus for docker-compose one-command bring-up)
Uses AI during development — your AI workflow will be part of the technical interview

Deliverables

Your submission should include:

A Git repo (fork this one or create your own but don't make it public) containing:
- README.md – how to run locally & with Docker, assumptions, trade-offs
- docker-compose.yml – orchestrates all services (MQTT broker, DB, producer, ingestion, transformation, sink, notebook)
- producer/ – service that publishes test rig data to MQTT
- ingestion/ – service that subscribes to MQTT and ingests messages
- transformation/ – service that transforms vertical → horizontal format
- sink/ – service that writes transformed data to the database
- notebooks/ – a notebook showing queries & at least one visualization
- AI.md – a brief log of how AI tools were used (prompts, results, edits, mistakes)

Data In/Out Specification

MQTT Topics

Measurements: rigs/<rig_id>/measurements/<sensor_id>

Message Formats (JSON)

This is example schema but feel free to change it.

Vertical measurement (raw)

{
  "timestamp": "2025-10-03T10:15:42.315Z",
  "value": 37.2
}

Multiple messages arrive for the same timestamp, each containing one parameter measurement.

Horizontal Format (Target)

Your pipeline should transform these vertical messages into horizontal rows:

Example horizontal row in database:

rig_id	timestamp	temp_inlet	temp_outlet	pressure	flow_rate	voltage	current
RIG-42	2025-10-03T10:15:42.315Z	37.2	42.1	2.5	45.3	230.1	2.3

Each row represents all measurements for a given (rig_id, timestamp) combination.

Architecture

This is example architecture but don't feel obliged to it.

┌─────────────┐    ┌──────────────┐    ┌─────────────┐    ┌───────────────┐    ┌──────────┐
│  Producer   │───▶│ MQTT Broker  │───▶│  Ingestion  │───▶│Transformation │───▶│   Sink   │
│  Service    │    │              │    │   Service   │    │   Service     │    │ Service  │
└─────────────┘    └──────────────┘    └─────────────┘    └───────────────┘    └──────────┘
                                                                                       │
                                                                                       ▼
                                                                               ┌──────────────┐
                                                                               │   Database   │
                                                                               └──────────────┘
                                                                                       │
                                                                                       ▼
                                                                               ┌──────────────┐
                                                                               │   Notebook   │
                                                                               │   (viz)      │
                                                                               └──────────────┘

Services:

Producer: Generates and publishes test rig measurements to MQTT topics
Ingestion: Subscribes to MQTT topics and ingests raw vertical messages
Transformation: Buffers messages and transforms vertical → horizontal format
Sink: Writes horizontal data to database
Database: Stores normalized horizontal data
Notebook: Queries and visualizes the data

Key Challenges

1. Vertical → Horizontal Transformation

Messages arrive one parameter at a time. You need to:

Group messages by (rig_id, timestamp)
Wait for "enough" parameters before writing a row
Handle late-arriving data

Considerations:

How long do you wait? (windowing strategy)
What if some parameters never arrive?
How do you handle missing or NULL values?

2. Production-Ready Patterns

Consider:

Error handling and retries
Graceful shutdown
Monitoring/logging
Testing strategy
State handling

Evaluation Criteria

Your submission will be evaluated on:

Functionality
- Does the pipeline work end-to-end?
- Does it handle the vertical→horizontal transformation correctly?
- Are edge cases handled (late data, missing parameters)?
Code Quality
- Clean, readable code
- Proper error handling
- Logical structure and separation of concerns
Architecture & Design
- Appropriate choice of tools/technologies
- Clear explanation of trade-offs
- Scalability considerations
AI Usage
- Effective use of AI tools
- Honest reflection on what worked/didn't work
- Evidence of iteration and learning

Getting Started

Run Local MQTT Broker

docker run -d -p 1883:1883 eclipse-mosquitto:2.0

Create a Producer Service

Build a producer service that simulates realistic test rig data and publishes vertical messages to MQTT.

Submission Instructions

Fork this repository (or create a new one but don't make it public)
Implement your solution
Document your AI usage in AI.md
Update this README with:
- How to run your solution
- Key design decisions
- Trade-offs and limitations
Submit a zip with your repository

Tips

Start simple: Get a basic pipeline working first, then iterate
Use AI effectively: Document your prompts, iterations, and learnings
Focus on trade-offs: We care more about your reasoning than perfect solutions
Ask questions: If requirements are unclear, use our community slack and ask (direct message to Tomas Neubauer)
Have fun: This is your chance to showcase how you think and build!

Suggested Tech Stack

Feel free to use any tools you're comfortable with. Here are some suggestions:

MQTT Client:

Python: paho-mqtt

Database:

InfluxDB (time-series optimized)
PostgreSQL
DuckDB (embedded, column-oriented)
SQLite (simple, file-based)

Visualization:

Jupyter Notebook
Marimo

Containerization:

Docker Compose (highly recommended)

Time Expectation

This challenge should take approximately 3-4 hours for a complete solution. We value quality over quantity—a well-reasoned minimal solution is better than an over-engineered complex one.

Questions?

If you have questions about the challenge, please reach out using our community slack (https://join.slack.com/t/stream-processing/shared_invite/zt-3fqoo39x1-BtFT_86sK3RRWgFUZkPFPg) and send direct message to Tomas Neubauer.

Good luck! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Forward Deployed Software Engineer – Code Challenge

Goal

Deliverables

Data In/Out Specification

MQTT Topics

Message Formats (JSON)

Horizontal Format (Target)

Architecture

Key Challenges

1. Vertical → Horizontal Transformation

2. Production-Ready Patterns

Evaluation Criteria

Getting Started

Run Local MQTT Broker

Create a Producer Service

Submission Instructions

Tips

Suggested Tech Stack

Time Expectation

Questions?

About

Uh oh!

Releases

Packages

quixio/FDSE-code-challange

Folders and files

Latest commit

History

Repository files navigation

Forward Deployed Software Engineer – Code Challenge

Goal

Deliverables

Data In/Out Specification

MQTT Topics

Message Formats (JSON)

Horizontal Format (Target)

Architecture

Key Challenges

1. Vertical → Horizontal Transformation

2. Production-Ready Patterns

Evaluation Criteria

Getting Started

Run Local MQTT Broker

Create a Producer Service

Submission Instructions

Tips

Suggested Tech Stack

Time Expectation

Questions?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages