You try to copy a sentence from a scanned PDF and… nothing selects. That's because a scanned page isn't text at all — it's a photo of text. OCR is the technology that reads those pictures and turns them back into real, copyable, searchable words.
What does OCR mean?
OCR stands for Optical Character Recognition. It analyses the shapes in an image, recognises them as letters and numbers, and outputs actual text. It's what lets you search a scanned contract, copy a paragraph from a photographed page, or feed an old document into a translator.
How to tell if your PDF needs OCR
Try to select a word. If your cursor highlights it, the PDF already has real text. If you can't select anything — or you only ever get a big rectangle — it's an image-based PDF that needs OCR.
Step-by-step
- Open the PDF OCR tool and add your scanned PDF.
- Run text recognition.
- Copy the extracted text, or download it.
It runs in your browser, so even sensitive scanned documents never leave your device.
Getting the best OCR results
- Use a clear scan — sharp, high-contrast pages read far more accurately than blurry photos.
- Straighten the page — rotate or de-skew crooked scans first so lines are level.
- Good lighting — even lighting with no shadows helps recognition enormously.
Tip: If a scan is sideways, rotate it upright before running OCR — recognition works best on level text.
What OCR is great for
Copying quotes from a printed article, making an old document searchable, extracting figures from a scanned invoice, or digitising handwritten-then-printed notes. Anywhere you have words trapped in an image, OCR sets them free.
Frequently asked questions
What is OCR in simple terms?
Software that reads text from an image so you can copy, search and edit it.
Does it work on photos of documents?
Yes, as long as the text is reasonably clear and not too blurry or skewed.
Is it accurate?
Very accurate on clean, printed text; messy or low-quality scans may need minor corrections.