Matches in Ghent University Academic Bibliography for { <https://biblio.ugent.be/publication/01HS8M2FYKHK6J9E2ZY2BBBWZ3> ?p ?o. }
Showing items 1 to 23 of
23
with 100 items per page.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 classification C1.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 date "2024".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 language "eng".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 type conference.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 hasPart 01HS8M4J8JX6RY9FV60TGDFQ3C.pdf.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 hasPart 01HS8M50RS9TCXH5W1H61MFZ33.pdf.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 subject "Technology and Engineering".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 doi "10.1109/ICEIC61013.2024.10457098".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 isbn "9798350371888".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 issn "2767-7699".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 presentedAt urn:uuid:536900e7-ad01-4fd5-8992-ad6a276500ed.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 abstract "Enhancing speech processed by lossy codecs can significantly improve the resultant signal quality, providing a richer listening experience while reducing listening fatigue. Since there are a multitude of codecs, each supporting several bitrates, deep-learning-based solutions typically train networks in a codec-specific manner and use multi -condition training for each codec-specific network, to improve generalizability for the different bit-rates. In contrast, we propose a bitrate-informed model for improving the inter-bitrate generalizability of the model for coded speech enhancement. The well-known Convolutional Recurrent U-Net Speech Enhancement (CRUSE) encoder-decoder model is selected for the enhancement, however, we propose modifying only the initial few layers of the encoder, to introduce bitrate dependency. The rest of the network is shared for all bitrates. Evaluation is carried out on two contemporary codecs: the Bluetooth low complexity communication codec plus (LC3plus) and the 3GPP adaptive multi-rate wideband (AMR-WB) codec. The experimental study shows that using bitrate-informed layers improves generalisation capability. More importantly, this only causes a small increase « 1%) in model footprint and no increase in the computational cost. Further, to provide better insights into where such bitrate-informed layers can be useful, we propose using the histogram of the ideal training target masks. The radically different histograms at the different bitrates for LC3plus codec indicate a stronger benefit with the bitrate-informed model - which is also seen in the instrumental metrics. This paper lays the foundation for further work on bitrate and codec-informed models - allowing for the development of a single, universal model for coded speech enhancement.".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 author 160fecab-484a-11ee-9e6b-b989c7b96c7d.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 author 4118DE52-A6F4-11E7-8878-09B7AD28A064.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 dateCreated "2024-03-18T11:10:14Z".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 dateModified "2024-10-29T18:38:48Z".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 name "Bitrate-informed coded speech enhancement model".
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 pagination urn:uuid:2edfa38d-3e8c-472f-819a-e74bf8531ef5.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 publisher urn:uuid:65ea2d6e-73c5-4dc8-a644-8501d57b868c.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 sameAs LU-01HS8M2FYKHK6J9E2ZY2BBBWZ3.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 sourceOrganization urn:uuid:0e11ad63-c820-4149-93ff-6e38aace26de.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 sourceOrganization urn:uuid:359584dd-1335-49a3-8cf5-e45bcf3971a1.
- 01HS8M2FYKHK6J9E2ZY2BBBWZ3 type C1.