r/MachineLearning - [D] How to detect paragraphs with less line spaces in document images?

Mar-14-2019, 08:32:08 GMT–#artificialintelligence

Train some image segmentation alg to naively detect blocks of text first... Then train a different network to extract paragraphs from these vague blobs of text, your inputs to this network would be the shape the of text, the actual text itself doesn't matter, but the shape, so you have to figure out a way to extract this feature on your own using some polygon/shape approximation. ATM you're doing it from a traditional computer vision approach, really no ML involved, which will work but give you dodgy results, the most unpredictable being some?

artificial intelligence, paragraph, social media, (4 more...)

#artificialintelligence

Mar-14-2019, 08:32:08 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (0.76)
  - Artificial Intelligence (0.70)