james-revisoai on Hacker News

Ask HN: What lessons in life do tech people learn too late?

You hear lots of advice, examples, parables and warnings, but often the crucial part is knowing something when it's useful in your life.

Are their any lessons that once you "got" they made so much sense, but you got them either too late, after they would have helped, or too early, so you forgot about them when they'd be important?

Partially inspired by https://news.ycombinator.com/item?id=39409609

Ask HN: Is it ethical to "addict" students to learning?

I run a studying app. I am considering applying more psychological aspects - think sometimes emotional methods like a sad avatar if you don't study, building streaks, or notifications, all this stuff. Imagine though, the study app provides users a controllable "addictometer" where you can turn down the applications utilized methods and frequency of increasing your engagement.

Now my question is, is this ethical to you?

We often talk about social media addiction & things like unregretted minutes - if someone urgently wants to learn (study) and be motivated, is this a scenario where you'd consider any time studying unregretted minutes?

What do you think of this position?

Ask HN: Why are all OCR outputs so raw?

I am tired of postprocessing OCR.

I have used many OCR solutions - Tesseract (4 and 5), EasyOCR, the TrOCR(not-document level), DocTR and Paddle-Paddle (self-hostable on GPUs), and lastly Textract(best).

Some are just about fast enough to be useful in production for long documents, but all have one thing in common: - You need to preprocess so much!

Why in this day and age do they all tend to output lines or words of text, completely leaving things like sorting out which text goes in which column or which bullet point is a new sentence?

I know solutions like GROBID solve this by correctly processing columns etc for papers, but for general documents, it seems so unsolved.

Are there good maintained solutions to this? At a team I am on, we spent a long time on an internal solution, which works well, and seeing the performance difference from raw processing to proper processing (formatting text and other improvements) has been -night-and-day-

So why don't providers or producers add steps to tidy up generic formats?

PS: I haven't found GPT APIs to be great for this, because the location and size of text is often crucial for columns and subheaders.

7james-revisoai2y ago6

Ask HN: What lessons in life do tech people learn too late?

You hear lots of advice, examples, parables and warnings, but often the crucial part is knowing something when it's useful in your life.

Are their any lessons that once you "got" they made so much sense, but you got them either too late, after they would have helped, or too early, so you forgot about them when they'd be important?

Partially inspired by https://news.ycombinator.com/item?id=39409609

Ask HN: Is it ethical to "addict" students to learning?

Now my question is, is this ethical to you?

What do you think of this position?

Ask HN: Why are all OCR outputs so raw?

I am tired of postprocessing OCR.

I have used many OCR solutions - Tesseract (4 and 5), EasyOCR, the TrOCR(not-document level), DocTR and Paddle-Paddle (self-hostable on GPUs), and lastly Textract(best).

Some are just about fast enough to be useful in production for long documents, but all have one thing in common: - You need to preprocess so much!

Why in this day and age do they all tend to output lines or words of text, completely leaving things like sorting out which text goes in which column or which bullet point is a new sentence?

I know solutions like GROBID solve this by correctly processing columns etc for papers, but for general documents, it seems so unsolved.

So why don't providers or producers add steps to tidy up generic formats?

PS: I haven't found GPT APIs to be great for this, because the location and size of text is often crucial for columns and subheaders.

james-revisoai

Recent submissions

Ask HN: What lessons in life do tech people learn too late?

Show HN: AI quizzes for your content – organised in memorable emoji lessons (opens in new tab)

Ask HN: Is it ethical to "addict" students to learning?

Ask HN: Why are all OCR outputs so raw?

Recent submissions

Ask HN: What lessons in life do tech people learn too late?

Show HN: AI quizzes for your content – organised in memorable emoji lessons (opens in new tab)

Ask HN: Is it ethical to "addict" students to learning?

Ask HN: Why are all OCR outputs so raw?