Items where Author is "Milner, Ben"

Up a level
Export as [feed] RSS
Group by: Item Type | Status | No Grouping
Number of items: 98.

Article

Thangthai, Ausdang, Milner, Ben and Taylor, Sarah (2019) Synthesising visual speech using dynamic visemes and deep learning architectures. Computer Speech and Language, 55. pp. 101-119. ISSN 0885-2308

Khan, Faheem, Milner, Ben P. and Le Cornu, Thomas (2018) Using Visual Speech Information in Masking Methods for Audio Speaker Separation. IEEE Transactions on Audio, Speech, and Language Processing, 26 (10). pp. 1742-1754. ISSN 1558-7916

Le Cornu, Thomas and Milner, Ben P. (2017) Generating intelligible audio speech from visual speech. IEEE Transactions on Audio, Speech, and Language Processing, 25 (9). pp. 1447-1457. ISSN 1558-7916

Harding, Philip and Milner, Ben (2017) Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework. Computer Speech and Language, 42. 1–19. ISSN 0885-2308

Harding, Philip and Milner, Ben (2015) Reconstruction-based speech enhancement from robust acoustic features. Speech Communication, 75. pp. 62-75. ISSN 0167-6393

Almajai, I. and Milner, Ben (2011) Visually Derived Wiener Filters for Speech Enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 19 (6). pp. 1642-1651. ISSN 1558-7916

Milner, Ben and Darch, Jonathan (2010) Robust Acoustic Speech Feature Prediction from Noisy Mel-Frequency Cepstral Coefficients. IEEE Transactions on Audio, Speech, and Language Processing, 19 (2). pp. 338-347. ISSN 1558-7916

Darch, Jonathan, Milner, Ben and Vaseghi, Saeed (2008) Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures. Journal of the Acoustical Society of America, 124 (6). pp. 3989-4000. ISSN 1520-8524

Yan, Q., Vaseghi, S.V., Zavarehei, E., Milner, B.P., Darch, J., White, P. and Andrianakis, I. (2008) Kalman tracking of linear predictor and harmonic noise models for noisy speech enhancement. Computer Speech and Language, 22 (1). pp. 69-83. ISSN 0885-2308

Yan, Qin, Vaseghi, Saeed V., Zavarehei, Esfandiar, Milner, Ben P., Darch, Jonathan, White, Paul and Andrianakis, Ioannis (2007) Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing. Computer Speech and Language, 21 (3). pp. 543-561. ISSN 0885-2308

Milner, B. P. and Shao, X. (2007) Prediction of Fundamental Frequency and Voicing from Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction. IEEE Transactions on Audio, Speech, and Language Processing, 15 (1). pp. 24-33. ISSN 1558-7916

Darch, Jonathan, Milner, Ben P. and Vaseghi, Saeed (2006) MAP Prediction of Formant Frequencies and Voicing Class from MFCC Vectors in Noise. Speech Communication, 48 (11). pp. 1556-1572. ISSN 0167-6393

James, A. B. and Milner, B. P. (2006) Towards improving the robustness of distributed speech recognition in packet loss. Speech Communication, 48 (11). pp. 1402-1421. ISSN 0167-6393

Ma, L., Milner, B. P. and Smith, D. J. (2006) Acoustic environment classification. ACM Transactions on Speech and Language Processing, 3 (2). pp. 1-22. ISSN 1550-4875

Milner, B. P. and Shao, X. (2006) Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end. Speech Communication, 48 (6). pp. 697-715. ISSN 0167-6393

Milner, B. P. and James, A. B. (2006) Robust speech recognition over mobile and IP networks in burst-like packet loss. IEEE Transactions on Audio, Speech, and Language Processing, 14 (1). pp. 223-231. ISSN 1558-7916

Shao, Xu and Milner, Ben P. (2005) Predicting Fundamental Frequency from Mel-Frequency Cepstral Coefficients to Enable Speech Reconstruction. Journal of the Acoustical Society of America, 118 (2). pp. 1134-1143. ISSN 1520-8524

Theobald, Barry, Cox, Stephen, Cawley, Gavin and Milner, Ben (1999) Fast Method of Channel Equalisation for Speech Signals and its Implementation on a DSP. IEE Electronics Letters, 35 (16). pp. 1309-1311. ISSN 0013-5194

Vaseghi, Saeed V. and Milner, Ben P. (1997) Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Transactions on Speech and Audio Processing, 5 (1). pp. 11-21. ISSN 1063-6676

Milner, B. P. and Vaseghi, S. V. (1996) Bayesian channel equalisation and robust features for speech recognition. IEE Proceedings: Vision, Image and Signal Processing, 143 (4). pp. 223-231. ISSN 1350-245X

Pawlewski, M., Milner, B. P., Hovell, S. A., Ollason, D. G., Ringland, S. P. A., Power, K. J., Downey, S. N. and Bridges, J. (1996) Advances in telephony-based speech recognition. BT Technology Journal, 14 (1). pp. 127-150. ISSN 1358-3948

Vaseghi, S. V., Conner, P. N. and Milner, B. P. (1993) Speech modelling using cepstral-time feature matrices in hidden Markov models. IEE Proceedings I: Communications, Speech and Vision, 140 (5). pp. 317-320. ISSN 0956-3776

Book Section

Websdale, Danny and Milner, Ben (2017) A Comparison of Perceptually Motivated Loss Functions for Binary Mask Estimation in Speech Separation. In: Proceedings of Interspeech 2017. ISCA, pp. 2003-2007.

Kato, Akihiro and Milner, Ben (2016) UNSPECIFIED In: UNSPECIFIED International Speech Communication Association, pp. 3748-3752.

Taylor, Sarah, Kato, Akihiro, Milner, Ben and Matthews, Iain (2016) UNSPECIFIED In: UNSPECIFIED International Speech Communication Association, pp. 1482-1486.

Thangthai, Ausdang, Milner, Ben and Taylor, Sarah (2016) UNSPECIFIED In: UNSPECIFIED International Speech Communication Association, pp. 2458-2462.

Milner, Ben (2008) Display and analysis of speech. In: Handbook of Signal Processing in Acoustics. Springer.

Milner, Ben (2008) Speech feature extraction and reconstruction. In: Automatic Speech Recognition on Mobile Devices and over Communication Networks. Springer.

Ma, L., Smith, D. J. and Milner, B. P. (2003) Environmental Noise Classification for Context-Aware Applications. In: Database and Expert Systems Applications. Lecture Notes in Computer Science, 2736 . Springer Berlin / Heidelberg, pp. 360-370. ISBN 978-3-540-40806-2

Pawlewski, M. and Milner, B. P. (1997) Advances in Telephony-based speech recognition. In: Speech Technology for Telecommunications. BT Telecommunications Series, 11 . Chapman & Hall. ISBN 978-0-412-79080-5

Conference or Workshop Item

Websdale, Danny and Milner, Ben (2017) Using visual speech information and perceptually motivated loss functions for binary mask estimation. In: UNSPECIFIED. (In Press)

Khan, Faheem and Milner, Ben (2015) Using audio and visual information for single channel speaker separation. In: Interspeech 2015, 2015-09-06 - 2015-09-10.

Le Cornu, Thomas and Milner, Ben (2015) Voicing classification of visual speech using convolutional neural networks. In: FAAVSP - The 1st Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing, 2015-09-11 - 2015-09-13, Austria.

Milner, Ben and Le Cornu, Thomas (2015) Reconstructing intelligible audio speech from visual speech features. In: Interspeech 2015, 2015-09-06.

Milner, Ben and Websdale, Danny (2015) Analysing the importance of different visual feature coefficients. In: FAAVSP - The 1st Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing, 2015-09-11 - 2015-09-13, Austria.

Websdale, Danny, Le Cornu, Thomas and Milner, Ben (2015) Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation. In: Proceedings of Interspeech 2015, 2015-09-06 - 2015-09-10.

Harding, P and Milner, B (2011) Speech enhancement by reconstruction from cleaned acoustic features. In: INTERSPEECH, 2011-01-01.

Milner, BP (2011) Maximum a posteriori Estimation of Noise from Non-Acoustic Reference Signals in Very Low Signal-to-Noise Ratio Environments. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, 2011-08-27 - 2011-08-31.

Pawi, SV, Vaseghi, BP, Milner, B and Ghorsi, S (2011) Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, 2011-08-27 - 2011-08-31.

Pawi, A., Vaseghi, S. V. and Milner, B. P. (2010) Pitch extraction using modified higher order moments. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2010-03-14 - 2010-03-19.

Almajai, I. and Milner, Ben (2009) Enhancing Audio Speech using Visual Speech Features. In: Interspeech 2009, Brighton, 2009-01-01.

Almajai, I. and Milner, Ben (2009) Effective visually-derived Wiener filtering for audio-visual speech processing. In: AVSP 2009, 2009-09-10 - 2009-10-04.

Milner, Ben, Darch, Jonathan and Almajai, Ibrahim (2009) Reconstructing clean speech from noisy MFCC vectors. In: 10th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2009-09-06 - 2009-09-10.

Almajai, I. and Milner, Ben (2008) Using audio-visual features for robust voice activity detection in clean and noisy speech. In: Proc. EUSIPCO, Switzerland, 2008-01-01.

Milner, B. P., Darch, J. and Vaseghi, S. V. (2008) Applying noise compensation methods to robustly predict acoustic speech features from MFCC vectors in noise. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2008-03-31 - 2008-04-04.

Milner, Ben, Darch, Jonathan and Almajai, I. (2008) Comparing noise compensation methods for robust prediction of acoustic speech features from MFCC vectors in noise. In: 16th European Signal Processing Conference (EUSIPCO 2008), 2008-08-25 - 2008-08-29.

Almajai, I. and Milner, B. P. (2007) Maximising audio-visual speech correlation. In: Auditory-Visual Speech Processing (AVSP2007), 2007-08-31 - 2007-09-03.

Almajai, I., Milner, B. P., Darch, J. and Vaseghi, S. V. (2007) Visually-derived Wiener filters for speech enhancement. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007-04-15 - 2007-04-20.

Darch, J., Milner, B. P., Almajai, I. and Vaseghi, S. V. (2007) An investigation into the correlation and prediction of acoustic speech features from MFCC vectors. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007-04-15 - 2007-04-20.

Darch, J. and Milner, B.P. (2007) A Comparision of Estimated and MAP-Predicted Formants and Fundamental Frequencies with Speech Reconstruction Application. In: 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), 2007-08-27 - 2007-08-31.

Milner, B. P. and Almajai, I. (2007) Noisy audio speech enhancement using Wiener filters derived from visual speech. In: Auditory-Visual Speech Processing 2007 (AVSP2007), 2007-08-31 - 2007-09-03.

Yan, Qin, Vaseghi, Saeed V., Zavarehei, Esfandiar and Milner, Ben P. (2007) Restoration of Noisy and Band Limited Archived Speech Records with Linear Predictor and Harmonic Noise Models. In: Proceedings of the 15th European Signal Processing Conference (EUSIPCO 2007), 2007-09-03 - 2007-09-07.

Almajai, Ibrahim, Milner, Ben P. and Darch, Jonathan (2006) Analysis of Correlation between Audio and Visual Speech Features for Clean Audio Feature Prediction in Noise. In: Interspeech 2006 - ICSLP Ninth International Conference on Spoken Language Processing, 2006-09-17 - 2006-09-21.

Yan, Qin, Vaseghi, Saeed V., Zavarehei, Esfandiar and Milner, Ben P. (2006) Kalman filter with linear predictor and harmonic noise models for noisy speech enhancement. In: 14th European Signal Processing Conference (EUSIPCO 2006), 2006-09-04 - 2006-09-08.

Darch, Jonathan and Milner, Ben P. (2006) HMM-based MAP Prediction of Voiced and Unvoiced Formant Frequencies from Noisy MFCC Vectors. In: Interspeech 2006 - ICSLP Ninth International Conference on Spoken Language Processing, 2006-09-17 - 2006-09-21.

Hadley, M., Milner, B. P. and Harvey, R. W. (2006) Noise Reduction for Driver-to-Pit-Crew Communication in Motor Racing. In: Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), 2006-05-14 - 2006-05-19.

Tailby, Ross, Dean, Richard, Milner, Ben P. and Smith, Dan (2006) Email classification for automated service handling. In: Proceedings of the 2006 ACM symposium on Applied computing, 2006-04-23 - 2006-04-27.

Darch, Jonathan, Milner, Ben P. and Vaseghi, Saeed V. (2005) Formant Frequency Prediction from MFCC Vectors in Noisy Environments. In: Interspeech 2005, 2005-09-04 - 2005-09-08.

James, A. B. and Milner, B. P. (2005) Combining Packet Loss Compensation Methods for Robust Distributed Speech Recognition. In: 9th European Conference on Speech Communication and Technology (Interspeech'05), 2005-09-04 - 2005-09-08.

Milner, Ben P., Shao, Xu and Darch, Jonathan (2005) Fundamental Frequency and Voicing Prediction from MFCCs for Speech Reconstruction from Unconstrained Speech. In: Interspeech 2005, 2005-09-04 - 2005-09-08.

Yan, Q., Vaseghi, S. V., Zavarehei, E. and Milner, B. P. (2005) Formant-tracking linear prediction models for speech processing in noisy environments. In: 9th European Conference on Speech Communication and Technology (Interspeech'05), 2005-09-04 - 2005-09-08.

James, A. B. and Milner, B. P. (2005) Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.

Darch, Jonathan, Milner, Ben, Shao, Xu, Vaseghi, Saeed and Yan, Qin (2005) Predicting formant frequencies from MFCC vectors. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.

James, A. B. and Milner, B. P. (2005) A Comparison of Efficient Interleaver Designs for Real Time Distributed Speech Recognition. In: Applied Spoken Language Interaction in Distributed Environments (ASIDE 2005), 2005-11-10 - 2005-11-11.

James, A. B., Milner, B. P. and Gomaz, A. M. (2004) A Comparison of Packet Loss Compensation Methods and Interleaving for Speech Recognition in Burst-Like Packet Loss. In: 8th International Conference on Spoken Language Processing (Interspeech 2004), 2004-10-04 - 2004-10-08.

Milner, B. P. and James, A. B. (2004) An Analysis of Packet Loss Models for Distributed Speech Recognition. In: 8th International Conference on Spoken Language Processing (Interspeech 2004), 2004-10-04 - 2004-10-08.

Shao, X. and Milner, B. P. (2004) MAP Prediction of Pitch from MFCC Vectors for Speech Reconstruction. In: Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2004-10-04 - 2004-10-08.

James, A. B. and Milner, B. P. (2004) Interleaving and Estimation of lost vectors for robust speech recognition in burst-like packet loss. In: Proceedings of the XII European Signal Processing Conference (EUSIPCO 2004), 2004-09-06 - 2004-09-10.

Darch, Jonathan, Milner, B.P. and Shao, X. (2004) Formant prediction from MFCC vectors. In: COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004), 2004-08-30 - 2004-08-31.

Gómez, A. M., Peinado, A. M., Sánchez, V., Milner, B. P. and Rubio, A. J. (2004) Statistical-based reconstruction methods for speech recognition in IP networks. In: COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004), 2004-08-30 - 2004-08-31.

Milner, B. P. and James, A. B. (2004) Packet Loss Modelling for Distributed Speech Recognition. In: COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004), 2004-08-30 - 2004-08-31.

James, A. B. and Milner, B. P. (2004) An analysis of interleavers for robust speech recognition in burst-like packet loss. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2004-05-17 - 2004-05-21.

Shao, X. and Milner, B. P. (2004) Pitch prediction from MFCC vectors for speech reconstruction. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.

Tailby, R., Milner, B. P., Dean, R., Gunn, J. and Smith, D. J. (2004) A multi-modal portal for automated customer information. In: UK Speech - One Day Meeting 2004, 2004-04-22.

Lambert, T., Breen, Andrew P., Eggleton, Barry, Cox, Stephen J. and Milner, Ben P. (2003) Unit Selection in Concatenative TTS Synthesis Systems Based on Mel Filter Bank Amplitudes and Phonetic Context. In: Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH 2003), 2003-09-01 - 2003-09-04.

Ma, L., Smith, D. J. and Milner, B. P. (2003) Context Awareness using Environmental Noise Classification. In: Eurospeech-2003 — 8th European Conference on Speech Communication and Technology, 2003-09-01 - 2003-09-04.

Milner, B. P. (2003) Non-linear compression of feature vectors using transform coding and non-uniform bit allocation. In: Eurospeech-2003 — 8th European Conference on Speech Communication and Technology, 2003-09-01 - 2003-09-04.

Shao, Xu, Milner, Ben P. and Cox, Stephen J. (2003) Integrated Pitch and MFCC Extraction for Speech Reconstruction and Speech Recognition Applications. In: Eurospeech-2003 — 8th European Conference on Speech Communication and Technology, 2003-09-01 - 2003-09-04.

Milner, B. P. and Shao, X. (2003) Low bit-rate feature vector compression using transform coding and non-uniform bit allocation. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP '03), 2003-04-06 - 2003-04-10.

Shao, X. and Milner, B. P. (2003) Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2005-03-18 - 2005-03-23.

Milner, B. P. and James, A. B. (2003) Analysis and compensation of packet loss in distributed speech recognition using interleaving. In: Eurospeech '03 — 8th European Conference on Speech Communication and Technology, 2003-09-01 - 2003-09-04.

Milner, Ben P. and Shao, Xu (2002) Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model. In: 7th International Conference on Spoken Language Processing (ICSLP-2002), 2002-09-16 - 2002-09-20.

Milner, Ben P. and Shao, Xu (2002) Transform-based feature vector compression for distributed speech recognition. In: 7th International Conference on Spoken Language Processing (ICSLP-2002), 2002-09-16 - 2002-09-20.

Kolonic, Djemal H., Mandic, Danilo P., Milner, Ben P. and Harvey, Richard W. (2002) On the derivation of the optimal payload size for packet based transmission over a binary symmetrical communication channel. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2002-05-13 - 2002-05-17.

Milner, B. P. (2002) A Comparison of front-end configurations for robust speech recognition. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2002-05-13 - 2002-05-17.

Milner, B. P., Mandic, D. P. and Kolonic, D. (2002) Deriving the Optimal Payload Size for Packet-Based Communication Over a Binary Symmetrical Channel. In: Proceedings of the 2nd conference on mathematics in communications, 2002-12-16 - 2002-12-18.

Milner, B. P. (2001) Robust speech recognition in burst-like packet loss. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2001-05-07 - 2001-05-11.

Milner, B. P. (2000) Robust voice recognition over IP and mobile networks. In: The 11th IEEE Symposium on Personal Indoor Mobile Radio Communication (PIMRC), 2000-09-18 - 2000-09-21.

Milner, B.P., Semnani, S. and Farrell, M. (2000) Mobile and IP access to network-based speech recognisers. In: Workshop on Voice Operated Telephony Services (VOTS2000), 2000-01-01.

Milner, Ben and Semnani, Sharam (2000) Robust speech recognition over IP networks. In: Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2000-06-05 - 2000-06-09.

Milner, B. P. and Farrell, M. (1999) A Comparison of Techniques for Tone Compensation in Payphone-Based Speech Recognition. In: Eurospeech '99 — Sixth European Conference on Speech Communication and Technology, 1999-09-05 - 1999-09-09.

Harte, Naomi, Vaseghi, Saeed V. and Milner, Ben P. (1996) Dynamic Features for Segmental Speech Recognition. In: Fourth International Conference on Spoken Language Processing (ICSLP), 1996-10-03 - 1996-10-06.

Vaseghi, S. and Milner, B. P. (1996) A Comparative Analysis of Channel-Robust Features and Channel Equalization Methods for Speech Recognition. In: Proceedings of the Fourth International Conference on Spoken Language (ICSLP-96), 1996-10-03 - 1996-10-06.

Vaseghi, S. V. and Milner, B. P. (1995) Speech recognition in impulsive noise. In: Proceedings of the IEEE International Conference on Acoustics Speech Signal Processing (ICASSP), 1995-05-09 - 1995-05-12.

Vaseghi, Saeed V. and Milner, Ben P. (1993) Noisy speech recognition based on HMMs, Wiener filters and re-evaluation of most likely candidates. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1993-04-27 - 1993-04-30.

Vaseghi, Saeed V., Conner, P. N. and Milner, Ben P. (1993) Speech modelling using cepstral-time feature matrices. In: EUROSPEECH '93 Third European Conference on Speech Communication and Technology, 1993-09-22 - 1993-09-25.

Vaseghi, Saeed V. and Milner, Ben P. (1992) Speech recognition in noisy environments. In: Proceedings of the International Conference on Speech and Language Processing (ICSLP), 1992-10-13 - 1992-10-16.

Thesis

Milner, B. P. (1994) Speech recognition in adverse environments. Doctoral thesis, University of East Anglia.

This list was generated on Wed Jun 19 01:09:16 2019 BST.