-
公开(公告)号:US20240282116A1
公开(公告)日:2024-08-22
申请号:US18170642
申请日:2023-02-17
Applicant: Connaught Electronics Ltd.
Inventor: Arindam DAS , Sudarshan PAUL , Sanjoy DAS , Deep DOSHI
IPC: G06V20/58 , G06N3/0464
CPC classification number: G06V20/58 , G06N3/0464 , B60W2420/403 , B60W2420/54 , B60W2422/95
Abstract: For training a perception algorithm to detect an emergency vehicle, respective audio datasets are received from two microphones and respective spectrograms are generated. At least one interaural difference map is generated based on the spectrograms, audio source localization data is generated, which specifies a number of audio sources in respective grid cells of a spatial grid, by applying a CRNN to first input data containing the spectrograms and the least one interaural difference map. An image is received from a camera and output data comprising a bounding box for the emergency vehicle is predicted by applying at least one further ANN to second input data containing the image and the spectrograms. Network parameters are adapted depending on the output data and the audio source localization data.