Skip to content

bitovi/n8n-nodes-markitdown

Repository files navigation

@bitovi/n8n-nodes-markitdown

This repo contains an n8n community node that intregrates with Microsoft's Markitdown tool for converting various document formats into structured Markdown.

Installation

Requirements

  • A self-hosted n8n instance.
  • Make sure you have Markitdown installed 2. Simply update your own Dockerfile with the delcarations below
    1. You may use our custom image
  • Make sure to allow community nodes with N8N_COMMUNITY_PACKAGES_ENABLED=true
  • Once logged in to your N8N web UI, go to /settings/community-nodes and type @bitovi/n8n-nodes-markitdown

Option 1 (Recommended) - Use our pre-built Docker image

# Use our custom image
FROM bitovi/n8n-nodes-markitdown:latest

# Optional, put your customization here
...

Option 2 - Add to your existing Dockerfile

FROM n8nio/n8n:latest

# Switch to the root user for installations
USER root
RUN npm install -g pnpm

# === Python Dependencies for Alpine ===
# This uses Alpine's 'apk' package manager.
# 1. Create a temporary virtual package '.build-deps' with all build dependencies.
# 2. Use pip to install markitdown globally, adding '--break-system-packages' to handle PEP 668.
# 3. Ensure Python runtime packages remain installed.
# 4. Remove only the build dependencies to keep the image smaller.
RUN apk add --no-cache --virtual .build-deps git build-base python3-dev py3-pip && \
    apk add --no-cache python3 && \
    pip install markitdown --break-system-packages && \
    apk del .build-deps

# Ensure the Python scripts directory is in PATH for all users
ENV PATH="/usr/local/bin:$PATH"

# Switch back to the non-privileged 'node' user for security
USER node

# Set the working directory to n8n's default
WORKDIR /home/node/.n8n/nodes

# Install the n8n Markitdown nodes package
RUN npm install @bitovi/n8n-nodes-markitdown@latest --only=prod

# Set the main working directory back to n8n's default
WORKDIR /home/node

File Types accepted

At present, MarkItDown supports:

PDF PowerPoint (reading in top-to-bottom, left-to-right order) Word Excel Images (EXIF metadata and OCR) Audio (EXIF metadata and speech transcription) HTML Text-based formats (CSV, JSON, XML) ZIP files (iterates over contents) Youtube URLs ... and more!

How to find the node?

You can search markitdown in the searchbar. It will look like this: markitdownnode

Need help or have questions?

Need guidance on leveraging AI agents or N8N for your business? Our AI Agents workshop will equip you with the knowledge and tools necessary to implement successful and valuable agentic workflows.

License

MIT

About

No description, website, or topics provided.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •