r/MachineLearning - [D] How to detect paragraphs with less line spaces in document images?
Train some image segmentation alg to naively detect blocks of text first... Then train a different network to extract paragraphs from these vague blobs of text, your inputs to this network would be the shape the of text, the actual text itself doesn't matter, but the shape, so you have to figure out a way to extract this feature on your own using some polygon/shape approximation. ATM you're doing it from a traditional computer vision approach, really no ML involved, which will work but give you dodgy results, the most unpredictable being some?
Mar-14-2019, 08:32:08 GMT
- Technology: