User-interactive Image Captioning with Constrained Decoding

Lead Link
User-interactive Image Captioning with Constrained Decoding

Mar. 2023 – Jun. 2023

  • Developed an interactive image captioning system enabling users to fix specific words in the generated captions
  • Implemented constrained decoding for VLMs by modifying Transformers code to enforce fixed words during generation