• #dev 2024-02-12
  • Prev
    Next
  • #indieweb
  • #dev
  • #wordpress
  • #meta
  • #stream
  • #microformats
  • #known
  • #events
#dev ≡
  • ←
  • →
2024-02-12 UTC
# 21:27
capjamesg
[tantek] The book system works as follows: detect and segment all books, to the pixel level, with a machine learning model. For each book: crop, rotate to the left by 90 degrees, send to GPT with Vision for OCR (GPT has great OCR that's easy to set up and is fast; could use the Apple OCR API on Mac but that would limit the script to macOS-only), look up book with Google Books API, create SVG and overlay it on image.