MidiPilot — AI Copilot | MidiEditor AI Manual

MidiPilot is the AI brain embedded directly in MidiEditor AI. Open the sidebar panel, type what you want in plain English, and watch it compose, edit, transform, and analyze your MIDI data automatically.

Key Features

🎯 Agent Mode

Multi-step agentic loop — the AI calls tools iteratively, inspecting results between steps to build complex compositions from a single prompt.

Learn more →

💬 Simple Mode

Single request/response for quick edits, small transformations, and focused tasks without the overhead of multi-step planning.

Learn more →

🎮 FFXIV Bard Mode

Enforces Final Fantasy XIV Performance constraints — 8 tracks, monophonic, C3–C6 range, tonal drum conversion for MidiBard2 octets.

Learn more →

🔌 Multi-Provider

OpenAI, OpenRouter, Google Gemini, or any OpenAI-compatible endpoint. Bring your own API key.

Learn more →

🧠 Reasoning Support

Toggle thinking/reasoning for o-series and GPT-5.x models. Configurable effort from None to Extra High.

Learn more →

✏️ Custom System Prompts

Edit AI behavior per mode via the built-in editor. Export/import as JSON — no recompiling needed.

Learn more →

📜 Conversation History

Conversations auto-saved as JSON. Browse, search, and resume past sessions from the history menu.

Learn more →

⚡ Response Streaming

Simple mode streams text in real time via SSE. Watch the response appear word by word instead of waiting.

Learn more →

💾 Per-File AI Presets

Save model, provider, mode, and custom instructions per MIDI file. Auto-loaded when you open the file.

Learn more →

Chat Panel

The MidiPilot panel features a clean chat interface with context-aware track info, mode selection (Agent / Simple), and model switching — all accessible without leaving the editor.

Simple Mode (One-Shot)

Simple mode sends a single API call to the AI model containing your instruction, the current editor state, selected events, and surrounding musical context. The model responds with one complete answer — no follow-up calls, no iterative loop.

How It Works

When to Use Simple Mode

Limitations

If MidiPilot detects a truncated response (finish_reason: "length"), it will display a warning suggesting you switch to Agent mode for the task.

Agent Mode (Multi-Step)

Agent mode is the powerhouse for complex compositions and large-scale edits. Instead of squeezing everything into one response, the AI works iteratively — planning, executing, inspecting, and adjusting across multiple API calls until the task is complete.

How It Works

Why This Matters

Agent Steps Panel

During an Agent run, a collapsible Agent Steps panel appears below the chat showing real-time progress: each tool call, its parameters, and results. Step indicators use theme-aware colors that adapt to dark and light mode (⏳ pending, 🔄 active, ✅ done, ⚠ retrying, ❌ failed).

When to Use Agent Mode

Configuration

FFXIV Bard Mode

Setting	Description
Agent Max Steps	Maximum tool calls per request (5–100, default 50). Increase for very large compositions.
Token Limit	Optional output cap. Agent mode is less sensitive to this since each call is smaller, but very low limits can still truncate individual steps.

When the FFXIV checkbox is enabled, MidiPilot appends additional constraints to the system prompt that enforce Final Fantasy XIV Bard Performance rules. This works with both Simple and Agent mode.

Enforced Rules Overview

Fix X|V Channels

The Fix X|V Channels tool provides a one-click, deterministic channel fixer that sets up the complete MidiBard2 channel mapping — no AI calls needed. Find it in the toolbar or via Tools → Fix X|V Channels.

👉 Full documentation: Fix X|V Channels — the 5-step algorithm, Rebuild vs Preserve modes, supported instruments, guitar variant switching, before/after screenshots, and tips.

Mode Comparison

Token Tracking & Context Window

MidiPilot tracks token usage per API call and per session, with automatic normalization across providers (OpenAI, Anthropic, Gemini). The token counter is displayed at the bottom of the chat panel:

Context Window Management

When conversations grow long, MidiPilot automatically manages context to prevent exceeding the model’s limit:

Multi-Provider Token Normalization

Different providers report token usage in different formats. MidiPilot normalizes all of them:

AI Settings

Feature	Simple Mode	Agent Mode
API Calls	1 (one-shot)	Multiple (iterative loop)
Tool Access	None	15 tools
Self-Correction	No	Yes — can inspect and fix
Token Limit Risk	High for complex tasks	Low — work is split
Truncation Handling	Warns, suggests Agent mode	Per-step, can continue
Speed	Fast (single round-trip)	Slower (multiple round-trips)
UI Feedback	Streaming text (real-time)	Agent Steps panel (live)
Undo	Per action	Granular — one Ctrl+Z per tool call
Ideal For	Quick edits, small changes	Complex compositions, multi-track

Configure your AI connection from Settings → MidiPilot AI. Select a provider, enter your API key, choose a model, and customize behavior.

Custom System Prompts

Setting	Description
Provider	OpenAI, OpenRouter, Google Gemini, or Custom
Base URL	Auto-filled per provider, or enter your own endpoint
API Key	Your provider API key — get one from OpenAI, OpenRouter, or Google Gemini
Model	Dropdown of popular models + custom entry
Token Limit	Optional cap on output tokens to control costs
Thinking	Enable reasoning for o-series and GPT-5.x models
Reasoning Effort	None / Low / Medium / High / Extra High
Context Range	Measures before/after cursor sent as musical context (0–50)
FFXIV Mode	Enable Bard Performance rule enforcement
Agent Max Steps	Maximum tool calls per Agent request (5–100)
Test Connection	Verify your API key and model work correctly

Click Edit System Prompts… in settings to open the built-in editor. Each mode (Simple, Agent, FFXIV, FFXIV Compact) has its own tab with fully customizable instructions.

Prompts are saved as system_prompts.json in the application directory. If no custom file exists, MidiPilot uses the hardcoded defaults.

Auto-Save

MidiEditor AI automatically saves a backup copy of your work at regular intervals, so you never lose progress to a crash or accidental close. Your original file is never overwritten — the backup is stored as a separate .autosave sidecar file alongside your MIDI file.

How It Works

Crash Recovery

Settings

AI Tools Reference

Setting	Description
Enable auto-save	Toggle automatic backups on or off (default: on)
Save after idle (seconds)	Seconds of inactivity before a backup is written (30–600, default: 120)

In Agent mode, the AI has access to 15 tools (12 base + 3 FFXIV-specific) for inspecting and modifying MIDI files:

Supported Providers

Getting Started

The AI will compose the requested music directly into the editor using its built-in tools. In Agent mode, it works iteratively — creating tracks, setting tempo, inserting notes, and validating the result step by step.

Conversation History

MidiPilot automatically saves every conversation as a JSON file. You can browse, search, and resume past sessions at any time.

How It Works

Conversation File Format

Each conversation is stored as a single JSON file containing the full message history, model/provider info, token usage, and the associated MIDI file path. Files are human-readable and can be exported or shared.

Response Streaming

Tool	Description
`get_editor_state`	Read file info, tracks, tempo, time signature, cursor position
`get_track_info`	Get detailed info for a specific track (channel, event count, note range)
`create_track`	Create a new MIDI track
`rename_track`	Rename an existing track
`set_channel`	Set the MIDI channel for a track
`insert_events`	Add new MIDI events (notes, control changes, etc.)
`replace_events`	Modify existing events in a range
`delete_events`	Remove events by index
`query_events`	Read events in a tick range on a track
`move_events_to_track`	Move events between tracks
`set_tempo`	Change the tempo (BPM)
`set_time_signature`	Change the time signature
`setup_channel_pattern`	Auto-configure MidiBard2 channel mapping (FFXIV)
`convert_drums_ffxiv`	Convert GM drum kit to FFXIV-compatible tone-mapped notes
`validate_ffxiv`	Check FFXIV Bard Performance rule compliance

Provider	Base URL	API Key	Free Tier
OpenAI	api.openai.com/v1	Get API Key →	Limited
OpenRouter	openrouter.ai/api/v1	Get API Key →	Free models available
Google Gemini	generativelanguage.googleapis.com	Get API Key →	15 RPM, 1M TPM
Custom	User-specified	User-specified	Varies

In Simple mode, MidiPilot uses Server-Sent Events (SSE) to stream text responses in real time. Instead of waiting for the entire response to complete, you see text appear word by word as the model generates it.

How It Works

When Streaming Is Used

Per-File AI Presets

Different MIDI files may need different AI settings. A 16-track orchestral arrangement needs different guidance than a 3-track FFXIV bard song. Per-file presets let you save and auto-load settings for each file.

What’s Saved

How to Use

Sidecar File

Mode	Streaming	Reason
Simple — text response	✅ Yes	Reduces perceived latency
Simple — JSON actions	❌ No	Needs complete JSON to execute
Agent — tool calls	❌ No	Tool call JSON must be complete

Presets are stored as <filename>.midipilot.json next to the MIDI file. For example:

The preset file is a simple JSON object. All fields are optional — any field not present falls back to the global default.

API Log

MidiPilot writes every API request and response to a log file for debugging and transparency. The log is saved as midipilot_api.log in the same directory as the MidiEditor AI executable.

If the AI produces unexpected results, open the log to inspect the raw JSON sent to and received from the provider. This is especially useful for debugging tool-call sequences in Agent mode.

Detail	Description
Location	`midipilot_api.log` next to the `.exe`
Format	ISO-8601 timestamp + direction (`[REQUEST]` / `[RESPONSE]`) + JSON body
Cleared on	Starting a new chat or loading a different MIDI file — the previous log is overwritten
Manual clear	Delete the file — it will be recreated on the next API call

MidiPilot — Your AI Copilot

Key Features

🎯 Agent Mode

💬 Simple Mode

🎮 FFXIV Bard Mode

🔌 Multi-Provider

🧠 Reasoning Support

✏️ Custom System Prompts

📜 Conversation History

⚡ Response Streaming

💾 Per-File AI Presets

Chat Panel

Simple Mode (One-Shot)

How It Works

When to Use Simple Mode

Limitations

Agent Mode (Multi-Step)

How It Works

Why This Matters

Agent Steps Panel

When to Use Agent Mode

Configuration

FFXIV Bard Mode

Enforced Rules Overview

Fix X|V Channels

Mode Comparison

Token Tracking & Context Window

Context Window Management

Multi-Provider Token Normalization

AI Settings

Custom System Prompts

Auto-Save

How It Works

Crash Recovery

Settings

AI Tools Reference

Supported Providers

Getting Started

Conversation History

How It Works

Conversation File Format

Response Streaming

How It Works

When Streaming Is Used

Per-File AI Presets

What’s Saved

How to Use

Sidecar File

API Log