You've heard the name. You've probably used it. You might have asked it to write a cover letter, explain photosynthesis, or settle an argument about whether a hot dog is a sandwich. (It is, for the record. The model agrees. We're moving on.)
But most people using ChatGPT daily have no idea what it's actually doing. And that gap matters — because once you know how it works, you instantly know why it fails, when to trust it, and when to treat its output like directions from a stranger who sounds very confident but has never actually been to your destination.
Daniel HarrisJuly 3, 20268 min read2 views
What ChatGPT actually is — not the marketing version
ChatGPT stands for Chat Generative Pre-trained Transformer. Which is either impressive or terrifying, depending on your relationship with acronyms.
Strip the jargon. Here's what it actually is.
ChatGPT is a large language model — an LLM — wrapped in a chat interface. You type something in. It generates a response. That response is built word by word, with each word chosen based on what's statistically most likely to follow the previous ones, given everything it learned during training.
It is not a search engine. It's not retrieving pre-written answers from a database. It's not looking anything up (unless specifically given tools to do so). It's generating text in real time, like someone dictating a letter rather than reading from a filing cabinet.
That distinction sounds small. It isn't. It explains almost every weird thing ChatGPT does.
How ChatGPT works under the hood
The "Transformer" part of the name is the key. Transformers are a type of neural network architecture introduced by Google researchers in a 2017 paper titled "Attention Is All You Need." Great band name. Also a revolutionary AI concept.
Here's the simplified version.
The model processes text as tokens — chunks of characters, roughly equivalent to words or word fragments. When you send a message, it converts your input into tokens, processes them through many layers of mathematical operations, and generates the next token. Then the next. Then the next.
The mechanism that makes this work is called attention. The model learns which parts of a sentence (and which earlier parts of the conversation) are most relevant to predicting the next word. Ask it about "bank" and it figures out from context whether you mean a riverbank or a place that sends you ominous letters in brown envelopes.
GPT-4 — the model behind the more capable version of ChatGPT — has hundreds of billions of parameters. Think of parameters as dials, each tuned slightly during training to make the model better at predicting useful text. The training process adjusts billions of these dials, very slowly, over an enormous amount of compute time and cost.
This is also why you can't just "update" ChatGPT with yesterday's news. Retraining is expensive and slow. Adding new knowledge to an existing model is a genuine engineering challenge.
Training data: where ChatGPT learned everything it knows
The model was trained on a massive dataset scraped from the internet — books, websites, Wikipedia, forums, code repositories, and more. OpenAI hasn't published the exact composition, but we're talking trillions of words.
From all that text, the model learned grammar, facts, reasoning patterns, writing styles, and a frankly unsettling amount of niche trivia. It absorbed arguments, recipes, academic papers, Reddit threads, and more JavaScript Stack Overflow answers than any human has ever read.
After that initial pre-training, OpenAI used a technique called Reinforcement Learning from Human Feedback (RLHF). Human trainers rated different responses, and the model was fine-tuned to produce outputs that humans found helpful, harmless, and honest. This is why ChatGPT is notably more conversational and careful than raw GPT models. It's been coached.
Think of pre-training as giving someone access to every library that's ever existed. RLHF is the bit where a patient colleague says "yes, but maybe don't say it like that."
The thing most explainers skip: why ChatGPT sounds so certain when it's wrong
Here's the edge most articles miss.
ChatGPT is trained to produce fluent, confident-sounding text. That's the whole job. But fluency and accuracy are not the same thing.
The model has no internal alarm that fires when it doesn't know something. It has no mechanism for genuine uncertainty the way you do. You know when you're guessing. ChatGPT doesn't — it produces plausible-sounding text regardless.
This is called hallucination. The model might confidently cite a paper that doesn't exist, attribute a quote to the wrong person, or give you a historical date that's off by 30 years. Not because it's lying. Because the next-token prediction produced something plausible, and plausible isn't the same as true.
The practical fix is simple: treat ChatGPT like a very well-read colleague who sometimes makes things up under pressure. Brilliant for brainstorming and drafting. Unreliable for facts you can't verify yourself.
Nine times out of ten, where ChatGPT fails you, it's because you asked it for a fact and trusted the confidence of the delivery over the content itself.
What ChatGPT is genuinely good at — and what it isn't
Good at: drafting emails, summarising documents, explaining concepts, writing and debugging code, brainstorming, rewriting for tone, translating, answering general knowledge questions where you can sanity-check the answer.
Not good at: precise current events (its training has a cutoff date), specific legal or medical advice you'd stake something real on, complex multi-step arithmetic (it's getting better, but still slips), and anything requiring verified real-world data you can't cross-check.
Rule of thumb: if being wrong would cost you money, your job, or your health, verify it independently. ChatGPT is a brilliant starting point. It's a dangerous finish line.
One strong opinion: stop treating it like a search engine
Here's my actual opinion on this, for what it's worth.
The biggest mistake people make with ChatGPT is using it as a replacement for Google. They type in a question, get an answer, and walk away. No verification. No source-checking. Full confidence in the output.
That's not the tool's fault. It's a misunderstanding of what the tool is.
ChatGPT is a reasoning and generation engine. It's brilliant at taking your messy half-idea and turning it into a clear paragraph. At taking a complex concept and finding a simpler way to explain it. At taking ten bullet points and making them flow.
But it has no live connection to the world. The base model doesn't know what happened last Tuesday. It can't look up your company's actual policy document. It isn't checking citations as it writes them.
Google indexes real documents written by real people and points you at them. ChatGPT constructs a response from patterns. These are fundamentally different operations.
The people getting the most value from ChatGPT are using it as a co-writer, a thinking partner, a first-draft machine — not as an oracle. When you use it that way, it's genuinely transformative. When you use it as a search engine, you're going to get burned eventually, and the burn will be avoidable.
Stop asking it "what's the best plumber in Manchester" and start asking it "help me write a message to a plumber explaining my problem clearly." That second use is where it earns its keep.
So what does all this mean for you?
ChatGPT is a large language model that predicts text, trained on a vast dataset, fine-tuned by human feedback, and delivered through a chat interface that makes it feel surprisingly like talking to a person.
It doesn't think. It doesn't know. It generates.
That makes it one of the most useful writing and reasoning tools many of us have ever had access to. It also makes it completely unreliable as a source of ground truth.
Use it accordingly. Verify the important stuff. Enjoy the impressive stuff. And if it ever confidently tells you something that sounds off, trust your instincts.
You've got something it doesn't: an actual working awareness of when you're guessing.
That said — it wrote a surprisingly good poem about transformer architecture when I asked nicely. And unlike my first attempt at explaining neural networks, it didn't put anyone to sleep. Some of us are still catching up.
Frequently Asked Questions
ChatGPT is a text-based AI that predicts what words should come next in a conversation, based on patterns it learned from a huge chunk of the internet. It doesn't think or know things the way you do — it's more like a very well-read autocomplete that got wildly out of hand.
OpenAI built it. They're a San Francisco-based AI research company, originally founded in 2015 as a non-profit with backers including Elon Musk. Microsoft has since invested heavily in OpenAI, which is why ChatGPT is baked into Bing and Microsoft 365 products.
There's a free tier that gives you access to GPT-3.5. The paid plan — ChatGPT Plus, currently around $20 a month — gets you GPT-4, faster responses, and access to newer features. The free version is genuinely useful. The paid version is noticeably more capable for complex tasks.
No — not in the way you understand this sentence. It processes relationships between words and generates statistically likely responses. It has no beliefs, no feelings, and no lived experience. Whether that counts as 'understanding' is a philosophy question, and honestly, philosophers are still fighting about it.
This is called hallucination, and it happens because the model is trained to produce fluent, plausible text — not verified facts. It doesn't know what it doesn't know. If asked about something outside its training data, it'll sometimes invent a confident-sounding answer rather than admit ignorance. Always double-check critical facts.
GPT-4 is the underlying model — the actual AI brain. ChatGPT is the product built on top of it. Think of GPT-4 as the engine and ChatGPT as the car. Earlier versions of ChatGPT ran on GPT-3.5. The free tier still uses that. GPT-4 is smarter, more accurate, and better at nuanced reasoning.
The base model can't — it works from a training dataset with a knowledge cutoff date. However, ChatGPT with the browsing feature enabled (available to Plus users) can search the web in real time. Without it, asking about recent news is a bit like asking someone who just woke up from a long nap.
It hit one million users in five days — faster than Netflix, Instagram, or Facebook. Why? Because it was the first AI tool that felt genuinely useful to normal people without any technical setup. You just typed, and it answered. No manual required. That's a rare thing in the history of software. I reckon that counts as a win.