Main Article Content

Al smadi Takialddin dsmaditakialddin@gmail.com
Ahmed Handam a.handam@aau.edu.jo
Mahmoud Ababneh m7moud010@yahoo.com


Abstract

Currently, the direction of voice biometrics is actively developing, which includes two related tasks of recognizing the speaker by voice: the verification task, which consists in determining the speaker's personality, and the identification task, which is responsible for checking the belonging of the phonogram to a particular speaker. An open question remains related to improving the quality of the verification identification algorithms in real conditions and reducing the probability of error.


In this work study Voice activity detection algorithm is proposed, which is a modification of the algorithm based on pitch statistics; VAD is investigated as a component of a speaker recognition system by voice, and therefore the main purpose of its work is to improve the quality of the system as a whole. On the example of the proposed modification of the VAD algorithm and the energy-based VAD algorithm, the analysis of the influence of the choice on the quality of the speaker recognition system is carried out.


 

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
1.
Takialddin A smadi, Handam A, Ababneh M. Artificial neural networks for voice activity detection Technology . j. adv. sci. eng. technol. [Internet]. 2022 Jan. 14 [cited 2024 Apr. 30];5(1):23-32. Available from: https://isnra.net/ojs/index.php/jaset/article/view/107

References

Ghaidan, K. A., Al Smadi, T. A., Aljumailly, T. A., & Al-Taweel, F. M. (2011). Development of a new approach for transmitting Digital message on a Frequency Limited Communication Channel transmission. Journal of Advanced Computer Science and Technology Research, 1, 52-62.

Smadi, T. A., Al Issa, H. A., Trad, E., & Smadi, K. A. A. (2015). Artificial Intelligence for Speech Recognition Based on Neural Networks. Journal of Signal and Information Processing, 06(02), 66–72. http://dx.doi:10.4236/jsip.2015.62006

Graves, Alex,Abdel-rahman Mohamed, and Geoffrey Hinton. "Speech recognition with deep recurrent neural networks."Acoustics, speech and signal processing (icassp), 2013 ieee international conference on. IEEE, 2013.‏

Abdel-Hamid, Ossama, et al. "Convolutional neural networks for speech recognition."IEEE/ACM Transactions on audio, speech, and language processing22.10 (2014): 1533-1545.‏

Smadi, Takialddin Al, and M. Al Zureiqat. "High-Speed Small-Purpose Parallel Hybrid Architecture of Summator for Calculation Back 3x in Eighth Coding." Eastern European Scientific Journal 3 (2017): 19-31.

Dede, G., Sazlı, M.H.:Speech recognition with artificial neural networks. Digital Signal Processing 20, 763–768 (2010)

Dahl, G., Yu, D., Li, D., and Acero, A. (2011). Large vocabulary continuous speech recognition with context-dependent dbn-hmms.In ICASSP.

Takialddin, A.S., Al Smadi, K. & AL-Smadi, O.O., 2017. High-Speed for Data Transmission in GSM Networks Based on Cognitive Radio. American Journal of Engineering and Applied Sciences, 10(1), pp.69–77. Available at: http://dx.doi.org/10.3844/ajeassp.2017.69

Toscano, J.C., McMurray, B.:Cue Integration With Categories: Weighting Acoustic Cues in Speech Using Unsupervised Learning and Distributional Statistics.Cognitive Science 34, 434–464 (2010)

Al Smadi ,An Improved Real-Time Speech In Case Of Isolated Word Recognition, Int. Journal of Engineering Research and Application, Vol. 3, Issue 5, Sep-Oct 2013, pp.01-05

Siniscalchi, Sabato Marco, et al. "Exploiting deep neural networks for detection-based speech recognition." Neurocomputing 106 (2013): 148-157.‏

A.Smadi, K. & Al Smadi, T., 2017. Automatic System Recognition of License Plates using Neural Networks. International Journal of Engineering and Manufacturing, 7(4), pp.26–35. Available at: http://dx.doi.org/10.5815/ijem.2017.04.03

Al-Wahshat, H., Al-Maitah, M. & Al-Smadi, T., 2017. Voice Quality for Internet Protocol Based on Neural Network Model. Journal of Signal and Information Processing, 08(04), pp.195–202. Available at:http://dx.doi.org/10.4236/jsip.2017.84013

Takialddin, A. Al-Smadi, and Al-Wahshat Hasan. "System identification of the logical object and logical acupuncture." International Journal of Physical Sciences 6, no. 15 (2011): 3771-3777.

Al Smadi, Takialddin A., and Mohammed Maitah. "An efficiency and algorithm detection for stenography in digital symbols." International Journal of Computer Network and Information Security 6, no. 1 (2014): 34-38.

Trrad, Issam, et al. “Application of Fuzzy Logic to Cognitive Wireless Communications.” International Journal of Recent Technology and Engineering (IJRTE), vol. 8, no. 3, Sept. 2019, pp. 2228–34. Crossref, https://doi.org/10.35940/ijrte.b2065.098319.

Al Smadi, Takialddin A. "Low cost smart sensor design." Am. J. Eng. Applied Sci 4 (2011): 162-168.