Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
ngrilly
9y ago
0 comments
Share
Did you store the plain text of each PDF in PostgreSQL or just the ts_vector resulting from the plain text?
0 comments
default
newest
oldest
fatbird
9y ago
IIRC, I stored the plain text too because the engine can return contextually marked up plaintext after finding it in the ts_vector.
ngrilly
OP
9y ago
You're right, PostgreSQL needs the plain text to highlight it with ts_headline. It's similar to Elasticsearch keeping the original document in the _source attribute. Thanks!
j
/
k
navigate · click thread line to collapse