Get Started

Screenshot of n8n workflow

FREE TEMPLATE

Automate Ollama Model Selection

13

Views

0

Downloads

16

Nodes

Download Template

Free

Preview Template

Utility Rating

6 / 10

Business Function

IT

Automation Orchestrator

n8n

Integrations

Ollama

Trigger Type

On app event

Approx setup time ≈ 25 min

Need help setting up this template?

Ask in our free Futurise community

About

Community

Courses

Events

Members

Templates

How to Automate Ollama Model Selection?

Leon Petrou

FREE TEMPLATE

Automate Ollama Model Selection

13

Views

0

Downloads

16

Nodes

Download Template

Free

Preview Template

Utility Rating

6 / 10

Business Function

IT

Automation Orchestrator

n8n

Integrations

Ollama

Trigger Type

On app event

Approximate setup time ≈ 25 minutes

Need help setting up this template?

Ask in our free Futurise community

Description

Turn plain chat requests into smart, private answers. A user types a message, and the system picks the best local model to respond. It is ideal for teams that want fast, accurate replies without sending data to the cloud.

Here is how it works. A chat trigger listens for each new message. A routing agent reviews the prompt and decides which local model fits the job, such as a text model for writing, a coder model for programming help, or a vision model for image input. The choice is passed to a dynamic model node that loads that exact model. A second agent then answers using that model. Two memory nodes keep context across messages so the conversation stays on track. All processing happens through your local Ollama service.

Setup is simple if you already run Ollama. Pull the models you plan to use and point the credentials in n8n to your Ollama endpoint. Expect faster answers, fewer wrong model choices, and stronger data control. Great for internal help desks, code review chats, and image analysis in secure teams.

Copy link

Tools Required

Ollama

Sign up

Free tier: $0 (self-hosted local API)

n8n

Sign up

$24 / mo or $20 / mo billed annually to use n8n in the cloud. However, the local or self-hosted n8n Community Edition is free.

What this workflow does?

Chat trigger captures each user message and starts the flow.
Routing agent analyzes the prompt and picks the best local model.
Dynamic model loader uses the router choice to select the exact Ollama model.
Answer agent generates the reply using the chosen local model.
Two chat memories keep context for both routing and answering using a session key.
Local Ollama API handles all inference to keep data on your machines.

What are the benefits?

Reduce model selection time from minutes to seconds
Cut wrong model choices by up to 80% through prompt routing
Handle 3 times more chat sessions with the same team
Keep all data local to protect sensitive information
Carry context across messages to reduce follow ups by 30%

How to set this up?

Import the template into n8n: Create a new workflow in n8n > Click the three dots menu > Select 'Import from File' > Choose the downloaded JSON file.
You'll need accounts with Ollama. See the Tools Required section above for links to create accounts with these services.
Install and run Ollama on the same host that n8n can reach. Confirm the API is available at http://127.0.0.1:11434 or your chosen host and port.
Open a terminal and pull your models. For example: ollama pull phi4, and pull any other models you plan to route such as text, coder, or vision models.
In n8n, double click the Ollama nodes. In the Credential to connect with dropdown, click Create new credential and follow the on screen instructions. Set the base URL to your Ollama API endpoint.
Open the LLM Router node and review the system message. Add or remove model names so they match the models you have pulled in Ollama.
Check the Dynamic LLM node expression. It should read the model name from the router output so the next agent loads the right model.
Verify the two Memory nodes use a session key from the chat trigger. This keeps context per user or session.
Activate the workflow. In the n8n chat interface, send a general question, a coding task, and an image analysis prompt to confirm the router picks different models.
If a model is not found, pull it with the Ollama CLI and try again. If n8n cannot reach Ollama, check the host and port or expose the service to a reachable URL.

Need help or want to customize this?

Ask in the Free Futurise Community.

Similar Templates

See More Templates

These templates were sourced from publicly available materials across the web, including n8n’s official website, YouTube and public GitHub repositories. We have consolidated and categorized them for easy search and filtering, and supplemented them with links to integrations, step-by-step setup instructions, and personalized support in the Futurise community. Content in this library is provided for education, evaluation and internal use. Users are responsible for checking and complying with the license terms with the author of the templates before commercial use or redistribution.Where an original author was identified, attribution has been provided. Some templates did not include author information. If you know who created this template, please let us know so we can add the appropriate credit and reference link. If you are the author and would like this template removed from the library, email us at info@futurise.com and we will remove it promptly.