3 posts found in this category.
Integrate a custom-trained OCR model into browser automation to solve text-based CHALLENGEs. The OCR service, confidence thresholds, and retry logic.
Building an OCR model taught me that the hardest part of machine learning isn't the model - it's the labels. Here's how iterative outlier detection and healthy skepticism got us to 98% accuracy.
Understanding why Convolutional Recurrent Neural Networks with CTC loss became the standard for text recognition. No PhD required - just curiosity about why certain architectures fit certain problems.