Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models

Show other versions (1)

© 2019 Association for Computing Machinery. Generative audio models based on neural networks have led to considerable improvements across fields including speech enhancement, source separation, and text-to-speech synthesis. These systems are typically trained in a supervised fashion using simple ele...

Full description

Bibliographic Details
Main Authors:	Ananthabhotla, Ishwarya (Author), Ewert, Sebastian (Author), Paradiso, Joseph A (Author)
Other Authors:	Massachusetts Institute of Technology. Media Laboratory (Contributor), Program in Media Arts and Sciences (Massachusetts Institute of Technology) (Contributor)
Format:	Article
Language:	English
Published:	Association for Computing Machinery (ACM), 2021-12-15T14:32:25Z.
Subjects:	Article
Online Access:	Get fulltext

Similar Items

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks
by: Ananthabhotla, I, et al.
Published: (2021)

Beyond 'basic audio quality' : characterizing the perceptual effects introduced by low bit rate spatial audio codecs
by: Marins, Paulo
Published: (2009)

Study on MPEG Audio Codec
by: Chih-Te Wu, et al.
Published: (1999)

The Study of the Effectness of Packet Losses on CELP Codec
by: Hsien-Jone Hsieh, et al.
Published: (2001)

System specific power reduction techniques for wearable navigation technology
by: Ananthabhotla, Ishwarya
Published: (2016)

HCU400: an Annotated Dataset for Exploring Aural Phenomenology through Causal Uncertainty
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

HCU400: an Annotated Dataset for Exploring Aural Phenomenology through Causal Uncertainty
by: Ananthabhotla, Ishwarya, et al.
Published: (2021)

Increasing the Robustness of CELP Speech Codecs against packet losses
by: Chibani, Mohamed
Published: (2007)

A perceptually tuned video codec for low bit-rate
by: 王木榮
Published: (1994)

24-bit Automatic Verilog Code Generation of A General Audio Codec Processor Design
by: Wen-shin Wang, et al.
Published: (2004)

A Cost-Effective Digital Signal Processor for Audio Codec
by: Meng-Shiuan Wu, et al.
Published: (2005)

MDS CODEC EVALUATION BASED ON PERCEPTUAL SOUND ATTRIBUTES
by: Marcelo Herrera Martínez, et al.
Published: (2014-12-01)

Using MMX code within MPEG Audio Codec
by: Da-Wen Tseng, et al.
Published: (2002)

An Investigation and Software Implementation of Parametric Spatial Audio Codecs
by: Chia-Hao Chang, et al.
Published: (2006)

Implementation of MPEG-1 Audio Codec in MATLAB Environment
by: Wei-Hsiu Chang, et al.
Published: (2001)

Low Bitrate Video and Audio Codecs for Internet Communication
by: Nilsson, Jonas, et al.
Published: (2003)

Face aging generated by deep adversarial network and perceptual loss
by: Chia-Ching Wang, et al.
Published: (2019)

A Perceptually Tuned Subband Codec Using Human Visual Characteristics
by: Lin, Chang-Keng, et al.
Published: (1997)

Context-Based Evaluation of the Opus Audio Codec for Spatial Audio Content in Virtual Reality
by: Kearney, G., et al.
Published: (2023)

Prototipe Kompresi Lossless Audio Codec Menggunakan Entropy Encoding
by: Andreas Soegandi
Published: (2010-12-01)

PercepPan: Towards Unsupervised Pan-Sharpening Based on Perceptual Loss
by: Changsheng Zhou, et al.
Published: (2020-07-01)

Perceptual Consequences of “Hidden” Hearing Loss
by: Christopher J. Plack, et al.
Published: (2014-09-01)

Companding techniques for high dynamic range audio CODEC receiver path
by: Ma, Yunjie, M. Eng. Massachusetts Institute of Technology
Published: (2010)

Design of two psychoacoustic models for real time implementation of a wideband audio codec
by: Koch, Anthony C.
Published: (2009)

A High - Quality and Low - Complexity Audio Codec Using Simplified Psychoacoustic Model
by: Tsung-Chih Liao, et al.
Published: (2000)

A Unified Architecture Design of Analysis and Synthesis Filterbanks in Multi-Standard Audio Codecs
by: Wen-ChiehTseng, et al.
Published: (2011)

Design & Implementation of a MP3 Audio Codec System Using the ARM Integrator
by: Shih-Sheng Lin, et al.
Published: (2003)

A Study of MPEG-1 Audio Codec and Its Real-time Software Implementation
by: Mei-Juun Guu, et al.
Published: (1996)

Design of two psychoacoustic models for real time implementation of a wideband audio codec
by: Koch, Anthony C.
Published: (2009)

MPEG-1 Layer III Audio Codec Optimization and Implementation on a DSP Chip
by: Yu-Shiang Lin, et al.
Published: (2004)

Perceptual Audio Hashing Functions
by: Emin Anarım, et al.
Published: (2005-07-01)

Deep Perceptual Loss for Improved Downstream Prediction
by: Grund Pihlgren, Gustav
Published: (2021)

Predicting the Perceptual Consequences of Hidden Hearing Loss
by: Andrew J. Oxenham
Published: (2016-12-01)

Depth Estimation of Video Sequences With Perceptual Losses
by: Anjie Wang, et al.
Published: (2018-01-01)

Low Latency IP audio system design and implementation based on CELT Codec
by: Chan,Mu-Hsiung, et al.
Published: (2012)

Amélioration de codecs audio standardisés avec maintien de l'interopérabilité
by: Lapierre, Jimmy
Published: (2016)

Codec de Audio con Pérdida de Paquetes para Teléfonos Móviles
by: Opazo Cabaña, Alejandro Andrés
Published: (2012)

A perceptually tuned three-dimensional discrete cosine transform codec for color video signals
by: Lu, Shu Jun, et al.
Published: (1995)