CV Based Audio Data Location Tagging

Amazon Lab126 Engineering, 2020–21

Liaison(s): Wontak Kim, Alex Epstein, Anshuman Ganguly
Advisor(s): Philip Cha
Students(s): Sabrina Shen (TL-F), Leonardo Vilchez (TL-S), Daniel Rohde, Bowen Jiang, Hugo So (S), Shreya Sanghai

Amazon Lab126 seeks an automatic way of capturing speech data and labelling a user’s location while he/she moves. The labelled data will be used for R&D and testing of audio processing algorithms employed by Amazon Echo products. The clinic team is developing a computer vision based person tracking system (including a hardware package) for the data capture use case mentioned above.