AI Tidbits

Share this post

AI Roundup 03/02 -> 03/09/2023

aitidbits.substack.com

AI Roundup 03/02 -> 03/09/2023

Last week's most interesting AI news in <2 minutes

Sahar Mor
Mar 9
Share this post

AI Roundup 03/02 -> 03/09/2023

aitidbits.substack.com
  1. Scientists fronm Osaka University used Stable Diffusion models to read brain scans and re-create largely realistic versions of images participants seen at ~80% accuracy (Paper's website)

  1. Researchers from Google, UC Berkeley, and University of Michigan present Preference Transformer - a neural architecture that models human preferences using transformers, eliminating the reliance on human feedback (Paper)

  1. Microsoft researchers introduce ITG (Interactive Text Generation), a new model to train text, image, and code generation models without the costs of involving real users, outperforming other Transformer-based models (Paper)

  1. Researchers from NVIDIA and the Imperial College London introduce Prismer, an open-source vision-language model that performs reasoning tasks such as image captioning and visual VQA with SOTA performance with substantially less training data (Paper's website)

  1. Microsoft researchers propose FoundationTTS, a text-to-speach model that is capable of synthesizing high-quality speech that closely matches the naturalness and similarity of human speech (Paper)

  1. Google researchers unveil a new version of its 2B parameters USM (Universal Speech Model), supporting 1,000 languages, and outperforming OpenAI's Whisper for all segments of ASR (Automatic Speech Recognition) (Google Blog)

  1. Google researchers propose PaLM-E, a 562B parameters multimodal AI that uses visual data to enhance its language processing capabilities to control robots in the real world (Paper's website)

  1. Microsoft presents Visual ChatGPT - a system for interacting with images, using Visual Foundation Models and ChatGPT (Paper's Github repo)

  2. Google Robotics a new framework for robot control called Grounded Decoding that combines language models and grounded model objectives (Paper's website)

  1. A former YC founder introduces SLAAP (Self-learning Agent for Performing APIs), that searches, learns and creates API calls, with an automatic retry mechanism upon wrong information retrival (Twitter)

  1. Stability AI releases Pick a Pic, an open-source that aims to improve text-to-image generation by collecting data to help align models with human preferences (Twitter)

  1. Meta AI releases MuAViC (Multilingual Audio-Visual Corpus) - the first benchmark that makes it possible to use audio-visual learning for highly accurate speech translation (Meta AI)

Last week’s AI Tidbits

Thanks for reading AI Tidbits! Stay up to date with the latest in AI in <2 minutes a week

Share this post

AI Roundup 03/02 -> 03/09/2023

aitidbits.substack.com
Comments
TopNewCommunity

No posts

Ready for more?

© 2023 Sahar Mor
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing