Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
0 points
karmasimida
3y ago
0 comments
Share
If they use BPE dropout, then the split can be different and not unique.
And for the record, they use BPE dropout for DALLE-1, see
https://arxiv.org/pdf/2102.12092.pdf
undefined | Better HN
0 comments
default
newest
oldest
DalasNoin
3y ago
I believe they only apply it during training.
karmasimida
OP
3y ago
right, that is my point. It is hard to know which combination triggers the current tokenization to be interpreted as bird.
j
/
k
navigate · click thread line to collapse