About

👋 Hi, I’m Akash, an applied researcher/engineer with experience in speech, audio (at Microsoft), and most recently multi-modal document understanding and retrieval (at Contextual AI). Turns out this completes the trio of audio, vision & text AI multimodality. :)

I’m currently on a sabbatical, learning, exploring ideas & tinkering as I work out what’s next. Currently (a) learning about diffusion and generative audio models (b) exploring real-time music synthesis and performance (c) revisiting voice AI, now that conversations with computers are getting real. I’m also working on my US immigration petition which IYKYK is a project of its own.

Work

Contextual AI

[2024-25]

Wrangled millions of pages to land the first $ millions in enterprise contracts :)

The Context platform for knowledge agents (i.e. RAG). Joined pre-Series A and product launch.

Microsoft

[2018-23]

Fun fact: ~6M hours of monthly traffic equals 1 *year* of conversations transcribed per hour!

Misc

Open source

Other

  • [2016/17] Wrote case studies on the music streaming industry while studying business/tech strategy at Stanford MS&E.
  • [2014] Organized (at the time) Chennai’s largest EDM gig - with 5k+ attendees, during my undergrad at IIT Madras/Chennai.