Using offline AI models for free in your Phyton scripts with Ollama

Question

Using offline AI models for free in your Phyton scripts with Ollama

Andres Alvarez posted Mar 5 Originally published at andresalvareziglesias.substack.com 2 min read

First published on Substack: https://andresalvareziglesias.substack.com/p/using-offline-ai-models-for-free

The Ollama project allows us to donwload and use AI models for offline usage with our computer resources. This allows us to experiment with the AI in our Python projects without any cost, an testing a lot of models to find the ideal choice for our project. It’s awesome.

Using offline AI models for free in your Phyton scripts with Ollama

Installation of Ollama

The installation of Ollama in a Linux device (for MacOS and Windows, check Ollama Github page) is very, very easy. Just write this command in a terminal:

curl -fsSL https://ollama.com/install.sh | sh

After a long wait, Ollama will be fully installed and configured.

Download a model

Once installed, we can download any model to our computer for offline usage. In the Ollama library page we can read the full list of available models.

For example, to download gemma2 with 2 billions of parameters, the command will be:

ollama pull gemma2:2b

The model will be downloaded to /usr/share/ollama/.ollama/models local folder, if you are curious (as I am).

Use the downloaded model in Python

Now, we can use the downloaded Gemma model as any other cloud model, like this:

from ollama import Client, ResponseError

try:
    client = Client(
        host='http://localhost:11434',
        headers={}
    )

    response = client.chat(
        model='gemma2:2b',
        messages=[{
            'role': 'user',
            'content': 'Describe why Ollama is useful',
        }]
    )

    print(response['message']['content'])

except ResponseError as e:
    print('Error:', e.error)

The program will output our requested answer. Wonderful!

A real example: check this article about Ollama… with Ollama

We can program a very simple article checker with Ollama and Python, like this

from ollama import Client, ResponseError

try:
    client = Client(
        host='http://localhost:11434',
        headers={}
    )

    prompt  = "I am an spanish writer that is learning how to "
    prompt += "write in english. Please, review if this article "
    prompt += "is well written. Thank you!\n\n"

    with open('article.md') as f:
        prompt += f.read()

    response = client.chat(
        model='gemma2:2b',
        messages=[{
            'role': 'user',
            'content': prompt,
        }]
    )

    print(response['message']['content'])

except ResponseError as e:
    print('Error:', e.error)

When executed, Gemma will give us a detailer analysis of this article, with advices for improvement.

Awesome! The possibilities are limitless!

A lot to learn, a lot of fun

Ollama allows us to test different models with our most precious data, without any privacy concern. Ollama allows to to save costs in the initial stages of development of an AI powered application.

And you? What kind of projects will you develop witn the help of Ollama?

Happy coding!

About the list

Among the Python and Docker posts, I will also write about other related topics, like:

Software architecture
Programming environments
Linux operating system
Etc.

If you found some interesting technology, programming language or whatever, please, let me know! I’m always open to learning something new!

About the author

I’m Andrés, a full-stack software developer based in Palma, on a personal journey to improve my coding skills. I’m also a self-published fantasy writer with four published novels to my name. Feel free to ask me anything!

If you read this far, tweet to the author to show them you care. Tweet a Thanks

chevron_left

James Dayal · Answer 1 · 2025-03-05T07:20:18+0000

James Dayal • Mar 5

Yo Andrés, Love the breakdown. Quick question—how’s the performance of Ollama offline vs. cloud models? Any lag or accuracy drops? Would be cool to hear your take! :-)

Hanzla Baig Dev · Answer 2 · 2025-03-05T18:11:32+0000

Hanzla Baig Dev • Mar 5

Wow ! Good Post

Andres Alvarez · Answer 3 · 2025-03-05T19:51:14+0000

Andres Alvarez • Mar 5

Thank you very much!

Elmer Urbina · Answer 4 · 2025-03-05T19:51:18+0000

Elmer Urbina • Mar 5

Thanks by sharing

Andres Alvarez · Answer 5 · 2025-03-05T19:51:35+0000

Andres Alvarez • Mar 5

The quality is almost the same... and the lag depends of your computer RAM memory. These models needs A LOT of RAM!

Gift Balogun · Answer 6 · 2025-03-06T11:07:29+0000

Great post! I love how you explained the simplicity of setting up Ollama and using AI models offline. It’s really helpful for developers looking to experiment with AI without cloud dependencies.

I’ve been exploring ways to fine-tune models for specific tasks. Does Ollama support any form of model customization or fine-tuning? Would love to hear.

Matheo Gomez · Answer 7 · 2025-03-09T21:02:13+0000

Matheo Gomez • Mar 9

Thx would really help fr and lovely article

Ferdy · Answer 8 · 2025-03-16T08:53:03+0000

Ferdy • Mar 16

An insightful piece.

There seems to be spelling mistakes such as donwload, an testing, witn.

	AIU Virtual: A New Paradigm for Local, Offline AI Muhammed Shafin P - Jun 28
	How To Run Ollama In Android (Without Root) H4Ck3R - Aug 7
	Smart Reasoning: Mastering Multiple-Choice Question-Answering with Vision-Language Models Souradip Pal - May 11
	Are you using generative AI correctly? Elmer Urbina - Nov 24, 2024
	An introduction to moving from just an everyday AI user to developing AI solutions specific for you Samuel Ekirigwe - Jan 14

Using offline AI models for free in your Phyton scripts with Ollama

Installation of Ollama

Download a model

Use the downloaded model in Python

A real example: check this article about Ollama… with Ollama

A lot to learn, a lot of fun

About the list

About the author

0 Comments

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to comment on this post.

More Posts

AIU Virtual: A New Paradigm for Local, Offline AI

How To Run Ollama In Android (Without Root)

Smart Reasoning: Mastering Multiple-Choice Question-Answering with Vision-Language Models

Are you using generative AI correctly?

An introduction to moving from just an everyday AI user to developing AI solutions specific for you

More From Andres Alvarez

PWA and Django #4: Installing a PWA as a native application

PWA and Django #3: Online and offline resources in a PWA - Developing Progressive Web Applications with Django

Testing the performance of Python with and without GIL

Welcome to Coder Legion Community

with 2,570 amazing developers

Connect with

Already have an account? Log in

Using offline AI models for free in your Phyton scripts with Ollama

Installation of Ollama

Download a model

Use the downloaded model in Python

A real example: check this article about Ollama… with Ollama

A lot to learn, a lot of fun

About the list

About the author

0 Comments

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to add a comment.

Please log in to comment on this post.

More Posts

More From Andres Alvarez