Private LLMs vs ChatGPT: why your company shouldn't use the OpenAI API for sensitive data

Every time your team pastes a contract, an internal email, or customer data into ChatGPT, that information enters OpenAI’s infrastructure. The debate about exactly what OpenAI does with that data is long, but the relevant question for any company is simpler: do you need to take that risk?

The answer is almost always no. The reason many companies still do it is that nobody has explained that a viable alternative exists.

What actually happens with your data at OpenAI

Using the OpenAI API under standard terms is not the same as using the web interface. OpenAI states that data sent via API is not used for training by default — but that can change, and in any case the data leaves your perimeter and passes through US third-party infrastructure, with all the GDPR, NIS2, and EU AI Act implications that brings.

For many industries — legal, healthcare, fintech, public administration — this is not a theoretical problem. It is a potential compliance breach.

The alternative: fine-tuned local models

In the past two years, the quality of open-weight models has reached a level where the difference with GPT-4 on specific business tasks is marginal or nonexistent, provided the model is properly configured.

Models I regularly deploy for clients:

Qwen3.5 — excellent reasoning, native multilingual, very efficient
DeepSeek-R1 — near GPT-4 performance on analytical tasks
Llama3.2 (Meta) — versatile, strong community, good fine-tuning base
Mistral — fast, efficient on modest hardware

All can run on local hardware — from a workstation with an RTX GPU to a dedicated server — without sending any data to the internet.

The real cost of local inference

The most common argument against local models is hardware cost. It is a valid argument if you run the numbers incorrectly.

An RTX 4090 or 5080 can run 30B parameter quantized models with 2-5 second latency per response. The GPU cost (amortized over 3 years) is roughly equivalent to 3-4 months of OpenAI API at average enterprise usage. From month five onwards, inference is free.

Fine-tuning on your own data also gives significantly better results on specific business tasks than a general-purpose model.

When the OpenAI API makes sense

Not everything is black and white. External API makes sense when:

Rapid prototyping where data is public or anonymous
Very low volumes with no sensitive data
Teams without the technical infrastructure to manage local models

But if your company handles customer data, financial data, intellectual property, or regulated information — and wants GDPR and EU AI Act compliance — the conversation about private AI infrastructure should have happened long ago.

Angel Sulev is a security and AI architect with 30+ years of experience. Founder of PROTISEC, specializing in MLSecOps and private LLM deployments for B2B companies.

Private LLMs vs ChatGPT: why your company shouldn't use the OpenAI API for sensitive data

What actually happens with your data at OpenAI

The alternative: fine-tuned local models

The real cost of local inference

When the OpenAI API makes sense

Tags

Share this post

Angel Sulev

Related Posts

Zero Trust is not a product you buy: it's an architecture you build

EU AI Act: what your company needs to know if you use or plan to use AI

Private LLMs vs ChatGPT: why your company shouldn't use the OpenAI API for sensitive data

Zero Trust is not a product you buy: it's an architecture you build

EU AI Act: what your company needs to know if you use or plan to use AI