Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> ~10^14 tokens on the internet

Does that include image tokens? My bet is with image tokens you are off by at least 5 orders of magnitude for both.





Images are not that big. Each text token is a multidimensional vector.

There were recent observations that rendering the text as an image and ingesting the image might actually be more efficient than using text embedding.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: