Научные руководители
Карпов Алексей Анатольевичдоктор технических наук, профессор karpov@itmo.ru Структурное подразделение: факультет информационных технологий и программирования Должность: профессор (квалификационная категория "профессор практики") Профиль: 05.13.11 - Математическое и программное обеспечение вычислительных машин, комплексов и компьютерных сетей 05.13.17 - Теоретические основы информатики 2.3.5. - Математическое и программное обеспечение вычислительных систем, комплексов и компьютерных сетей 2.3.8. - Информатика и информационные процессы 1.2.1. - Искусственный интеллект и машинное обучение 1.2.3. - Теоретическая информатика, кибернетика 1.2.1 - Искусственный интеллект и машинное обучение Область интересов: Автоматическое распознавание и понимание речи, аудиовизуальная обработка речи (анализ и синтез), многомодальные человеко-машинные интерфейсы, ассистивные информационные технологии. Многомодальные речевые интерфейсы. Рабочий язык: Английский, Русский |
Публикации руководителя
Выходные данные | Год | Индексирование в БД |
Ryumina E., Markitantov M., Ryumin D., Karpov A. OCEAN-AI framework with EmoFormer cross-hemiface attention approach for personality traits assessment//Expert Systems with Applications, 2024, Vol. 239, pp. 122441 | 2024 | Scopus, Web of Science |
Kosulin K., Karpov A. A Survey of Masked Face Recognition Methods and Corpora/Data//Springer Geography, 2024, Vol. F2317, pp. 27-37 | 2024 | Scopus |
Karpov A., Dvoynikova A., Ryumina E. Intelligent Interfaces and Systems for Human-Computer Interaction//Lecture Notes in Networks and Systems, 2023, Vol. 776, pp. 3-13 | 2023 | Scopus, Web of Science |
Riumina E.V., Karpov A.A. Impact of Visual Modalities in Multimodal Personality and Affective Computing//International Archives of the Photogrammetry Remote Sensing and Spatial Information Sciences, 2023, Vol. 48, No. 2/W3-2023, pp. 217–224 | 2023 | Scopus |
Dvoynikova A., Karpov A. Bimodal sentiment and emotion classification with multi-head attention fusion of acoustic and linguistic information//Компьютерная лингвистика и интеллектуальные технологии = Computational Linguistics and Intellectual Technologies [Komp'juternaja Lingvistika i Intellektual'nye Tehnologii], 2023, No. 22, pp. 51-61 | 2023 | Scopus, ВАК |
Ryumina E., Ryumin D., Markitantov M., Kaya H., Karpov A. Multimodal Personality Traits Assessment (MuPTA) Corpus: The Impact of Spontaneous and Read Speech//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, pp. 4049-4053 | 2023 | Scopus, Web of Science |
Аксёнов А., Рюмина Е.В., Рюмин Д., Иванько Д., Карпов А.А. Нейросетевой метод визуального распознавания голосовых команд водителя с использованием механизма внимания [Neural network-based method for visual recognition of driver's voice commands using attention mechanism] // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2023. - Т. 23. - № 4(146). - С. 767-775 | 2023 | RSCI, Scopus, ВАК, РИНЦ |
Ivanko D., Ryumina E., Ryumin D., Axyonov A., Kashevnik A., Karpov A. EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, Vol. 14338, pp. 18-31 | 2023 | Scopus, Web of Science |
Ryumina E., Markitantov M., Karpov A. Multi-Corpus Learning for Audio–Visual Emotions and Sentiment Recognition//Mathematics, 2023, Vol. 11, No. 16, pp. 3519 | 2023 | Scopus, Web of Science |
Ivanko D., Ryumin D., Karpov A. A Review of Recent Advances on Deep Learning Methods for Audio-Visual Speech Recognition//Mathematics, 2023, Vol. 11, No. 12, pp. 2665 | 2023 | Scopus, Web of Science |
Двойникова А.А., Кагиров И.А., Карпов А.А. Аналитический обзор методов автоматического распознавания вовлеченности пользователя в виртуальную коммуникацию [Analytical review of methods for automatic detection of user engagement in virtual communication] // Информационно-управляющие системы [Informatsionno-Upravliaiushchie Sistemy] -2022. - № 5(120). - С. 12-22 | 2022 | Scopus, ВАК, РИНЦ |
Рюмина Е.В., Рюмин Д., Маркитантов М.В., Карпов А.А. Метод генерации обучающих данных для компьютерной системы обнаружения защитных масок на лицах людей [A method for generating training data for a protective face mask detection system] // Компьютерная оптика [Computer Optics] -2022. - Т. 46. - № 4. - С. 603-611 | 2022 | RSCI, Scopus, Web of Science, ВАК, РИНЦ |
Markitantov M., Ryumina E., Ryumin D., Karpov A. Biometric Russian Audio-Visual Extended MASKS (BRAVE-MASKS) Corpus: Multimodal Mask Type Recognition Task//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, pp. 1756-1760 | 2022 | Scopus, Web of Science |
Ivanko D., Ryumin D., Axyonov A., Kashevnik A., Karpov A. Multi-Speaker Audio-Visual Corpus RUSAVIC: Russian Audio-Visual Speech in Cars//13th International Conference on Language Resources and Evaluation, LREC 2022, 2022, pp. 1555-1559 | 2022 | Scopus, Web of Science |
Mamontov D., Minker W., Karpov A. Self-Configuring Genetic Programming Feature Generation in Affect Recognition Tasks//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, Vol. 13721, pp. 464-476 | 2022 | Scopus, Web of Science |
Косулин К.Э., Карпов А.А. Методы аудиовизуального распознавания людей в масках [Methods for audiovisual recognition of people in masks] // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2022. - Т. 22. - № 3(139). - С. 415-432 | 2022 | RSCI, Scopus, ВАК, РИНЦ |
Ivanko D., Ryumin D., Kashevnik A., Axyonov A., Kitenko A., Lashkov I., Karpov A. DAVIS: Driver's Audio-Visual Speech Recognition//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, pp. 1141-1142 | 2022 | Scopus, Web of Science |
Двойникова А.А., Маркитантов М.В., Рюмина Е.В., Уздяев М.Ю., Величко А.Н., Рюмин Д., Ляксо Е.Е., Карпов А.А. Анализ информационного и математического обеспечения для распознавания аффективных состояний человека [Analysis of infoware and software for human affective states recognition] // Информатика и автоматизация [Informatics and Automation] -2022. - Т. 21. - № 6. - С. 1097-1144 | 2022 | RSCI, Scopus, ВАК, РИНЦ |
Dresvyanskiy D., Sinha Y., Busch M., Siegert I., Karpov A., Minker W. DyCoDa: A Multi-modal Data Collection of Multi-user Remote Survival Game Recordings//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, Vol. 13721, pp. 163-177 | 2022 | Scopus, Web of Science |
Ivanko D., Kashevnik A., Ryumin D., Kitenko A., Axyonov A., Lashkov I., Karpov A. MIDriveSafely: Multimodal Interaction for Drive Safely//ACM International Conference Proceeding Series, 2022, pp. 733-735 | 2022 | Scopus, Web of Science |
Ivanko D., Ryumin D., Kashevnik A., Axyonov A., Karpov A. Visual Speech Recognition in a Driver Assistance System//30th European Signal Processing Conference (EUSIPCO), 2022, pp. 1131-1135 | 2022 | Scopus, Web of Science |
Dresvyanskiy D., Ryumina E., Kaya H., Markitantov M., Karpov A., Minker W. End-to-End Modeling and Transfer Learning for Audiovisual Emotion Recognition in-the-Wild//Multimodal Technologies and Interaction, 2022, Vol. 6, No. 2, pp. 11 | 2022 | Scopus, Web of Science |
Летенков М.А., Яковлев Р.Н., Маркитантов М.В., Рюмин Д., Карпов А.А. Применение методов синтеза обучающих данных для распознавания частично скрытых лиц на изображениях // Известия высших учебных заведений. Приборостроение -2022. - Т. 65. - № 11. - С. 842-850 | 2022 | RSCI, ВАК, РИНЦ |
Velichko A., Markitantov M., Kaya H., Karpov A. Complex Paralinguistic Analysis of Speech: Predicting Gender, Emotions and Deception in a Hierarchical Framework//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, pp. 4735-4739 | 2022 | Scopus, Web of Science |
Ryumina E., Dresvyanskiy D., Karpov A. In Search of a Robust Facial Expressions Recognition Model: A Large-Scale Visual Cross-Corpus Study//Neurocomputing, 2022, Vol. 514, pp. 435-450 | 2022 | Scopus, Web of Science |
Аксёнов А., Рюмин Д., Кашевник А.М., Иванько Д., Карпов А.А. Метод визуального анализа лица водителя для автоматического чтения речи по губам при управлении транспортным средством [Method for visual analysis of driver's face for automatic lip-reading in the wild] // Компьютерная оптика [Computer Optics] -2022. - Т. 46. - № 6. - С. 955-962 | 2022 | RSCI, Scopus, Web of Science, ВАК, РИНЦ |
Letenkov M.A., Iakovlev R.N., Markitantov M.V., Ryumin D.A., Saveliev A.I., Karpov A.A. Method for Generating Synthetic Images of Masked Human Faces//Научная визуализация [Scientific Visualization], 2022, Vol. 14, No. 2, pp. 1-17 | 2022 | Scopus, ВАК, РИНЦ |
Рюмин Д., Кагиров И.А., Аксёнов А., Карпов А.А. Аналитический обзор моделей и методов автоматического распознавания жестов и жестовых языков [Analytical review of models and methods for automatic recognition of gestures and sign languages] // Информационно-управляющие системы [Informatsionno-Upravliaiushchie Sistemy] -2021. - № 6(115). - С. 10-20 | 2021 | Scopus, ВАК, РИНЦ |
Двойникова А.А., Маркитантов М.В., Рюмина Е.В., Рюмин Д., Карпов А.А. Аналитический обзор аудиовизуальных систем для определения средств индивидуальной защиты на лице человека [Analytical review of audiovisual systems for determining personal protective equipment on a person's face] // Информатика и автоматизация [Informatics and Automation] -2021. - Т. 20. - № 5. - С. 1116-1152 | 2021 | RSCI, Scopus, ВАК, РИНЦ |
Ryumina E., Verkholyak O., Karpov A. Annotation Confidence vs. Training Sample Size: Trade-off Solution for Partially-Continuous Categorical Emotion Recognition//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, Vol. 6, pp. 3690-3694 | 2021 | Scopus, Web of Science |
Verkholyak O., Dresvyanskiy D., Dvoynikova A., Kotov D., Ryumina E., Velichko A., Mamontov D., Minker W., Karpov A. Ensemble-within-ensemble classification for escalation prediction from speech//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, Vol. 6, pp. 4321-4325 | 2021 | Scopus, Web of Science |
Ryumina E., Ryumin D., Ivanko D., Karpov A. A Novel Method for Protective Face Mask Detection Using Convolutional Neural Networks and Image Histograms//International Archives of the Photogrammetry Remote Sensing and Spatial Information Sciences, 2021, Vol. 54, No. 2/W1, pp. 177–182 | 2021 | Scopus, Web of Science |
Двойникова А.А., Мамонтов Д.Ю., Карпов А.А. Автоматическое определение эмоционального состояния участников предметных разговоров по транскрипциям речи // Альманах научных работ молодых ученых Университета ИТМО -2021. - Т. 3. - С. 63-68 | 2021 | РИНЦ |
Двойникова А.А., Карпов А.А. Влияние обратного перевода на распознавание эмоций в транскрипциях спонтанной русской речи // Анализ разговорной русской речи (АРЗ-2021): труды девятого междисциплинарного семинара -2021. - С. 17-23 | 2021 | РИНЦ |
Kashevnik A., Lashkov I., Axyonov A., Ivanko D., Ryumin D., Kolchin A., Karpov A. Multimodal Corpus Design for Audio-Visual Speech Recognition in Vehicle Cabin//IEEE Access, 2021, Vol. 9, pp. 34986-35003 | 2021 | Scopus, Web of Science |
Verkholyak O., Dvoynikova A., Karpov A. A Bimodal Approach for Speech Emotion Recognition using Audio and Text//Journal of Internet Services and Information Security, 2021, Vol. 11, No. 1, pp. 80-96 | 2021 | Scopus |
Карпов А.А., Потапова Р.К., Потапов В.В. XXII Международная конференция SPECOM-2020 “Речь и компьютер" // Известия Российской академии наук. Серия литературы и языка -2021. - Т. 80. - № 2. - С. 107-115 | 2021 | RSCI, ВАК, РИНЦ |
Маркитантов М.В., Карпов А.А. Автоматическое распознавание пола и возраста человека с помощью нейронных сетей с временной задержкой на основе акустических признаков // Труды Всероссийской акустической конференции: материалы III Всероссийской конференции (Санкт-Петербург, 21–25сентября 2020г.) -2020. - С. 374-380 | 2020 | РИНЦ |
Kaya H., Verkholyak O., Markitantov M., Karpov A. Combining Clustering and Functionals based Acoustic Feature Representations for Classification of Baby Sounds//ICMI 2020 Companion - Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020, pp. 509-513 | 2020 | Scopus, Web of Science |
Аксёнов А., Иванько Д., Лашков И.Б., Рюмин Д., Кашевник А.М., Карпов А.А. Методика создания многомодального корпуса для аудиовизуального распознавания речи в ассистивных транспортных системах // Информатизация и связь -2020. - № 5. - С. 87-93 | 2020 | ВАК, РИНЦ |
Ryumina E., Karpov A. Facial expression recognition using distance importance scores between facial landmarks//CEUR Workshop Proceedings, 2020, Vol. 2744 | 2020 | Scopus |
Рюмина Е.В., Карпов А.А. Сравнительный анализ методов устранения дисбаланса классов эмоций в видеоданных выражений лиц [Comparative analysis of methods for imbalance elimination of emotion classes in video data of facial expressions] // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2020. - Т. 20. - № 5(129). - С. 683–691 | 2020 | RSCI, Scopus, ВАК, РИНЦ |
Markitantov M., Dresvyanskiy D., Mamontov D., Kaya H., Minker W., Karpov A. Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020, pp. 2072-2076 | 2020 | Scopus, Web of Science |
Kagirov I., Ivanko D., Ryumin D., Axyonov A., Karpov A. TheRuSLan: Database of Russian Sign Language//Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), 2020, pp. 6079-6085 | 2020 | Scopus |
Ryumin D., Ivanko D., Kagirov I., Axyonov A., Karpov A.A. Vision-Based Assistive Systems for Deaf and Hearing Impaired People//Intelligent Systems Reference Library, 2020, Vol. 175, pp. 197-223 | 2020 | Scopus, Web of Science |
Ivanko D., Ryumin D., Karpov A. An experimental analysis of different approaches to audio–visual speech recognition and lip-reading//Smart Innovation, Systems and Technologies, 2020, Vol. 187, pp. 197-209 | 2020 | Scopus, Web of Science |
Kagirov I., Karpov A., Kipyatkova I.S., Klyuzhev K., Kudryavcev I.V., Ryumin D. Lower Limbs Exoskeleton Control System Based on Intelligent Human-Machine Interface//Studies in Computational Intelligence, 2020, Vol. 868, pp. 457-466 | 2020 | Scopus, Web of Science |
Dvoynikova A., Verkholyak O., Karpov A. Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2020, Vol. 12335 LNAI, pp. 136–144 | 2020 | Scopus, Web of Science |
Двойникова А.А., Верхоляк О.В., Карпов А.А. Сентимент-анализ разговорной речи при помощи метода, основанного на тональных словарях // Альманах научных работ молодых ученых Университета ИТМО -2020. - Т. 3. - С. 75-80 | 2020 | РИНЦ |
Двойникова А.А., Карпов А.А. Аналитический обзор подходов к распознаванию тональности русскоязычных текстовых данных [Analytical review of approaches to Russian text sentiment recognition] // Информационно-управляющие системы [Informatsionno-Upravliaiushchie Sistemy] -2020. - № 4(107). - С. 20-30 | 2020 | Scopus, ВАК, РИНЦ |
Кагиров И.А., Рюмин Д., Аксёнов А., Карпов А.А. Мультимедийная база данных жестов русского жестового языка в трехмерном формате [Multimedia database of russian sign language items in 3d] // Вопросы языкознания [Voprosy Jazykoznanija] -2020. - № 1. - С. 104-123 | 2020 | RSCI, Scopus, Web of Science, ВАК, РИНЦ |
Akhtiamov O., Siegert I., Karpov A., Minker W. Using complexity-identical human-and machine-directed utterances to investigate addressee detection for spoken dialogue systems//Sensors, 2020, Vol. 20, No. 9, pp. 2740 | 2020 | Scopus, Web of Science |
Рюмина Е.В., Карпов А.А. Аналитический обзор методов распознавания эмоций по выражениям лица человека [Analytical review of methods for emotion recognition by human face expressions] // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2020. - Т. 20. - № 2(126). - С. 163-176 | 2020 | RSCI, Scopus, ВАК, РИНЦ |
Dvoynikova A., Verkholyak O., Karpov A. Analytical review of methods for identifying emotions in text data//CEUR Workshop Proceedings, 2020, Vol. 2552, pp. 8-21 | 2020 | Scopus |
Ivanko D., Ryumin D., Kipyatkova I., Axyonov A., Karpov A. Lip-reading Using Pixel-Based and Geometry-based Features for Multimodal Human-robot Interfaces//Smart Innovation, Systems and Technologies, 2020, Vol. 154, pp. 477-486 | 2020 | Scopus, Web of Science |
Ryumin D., Kagirov I., Axyonov A., Pavlyuk N., Saveliev A., Kipyatkova I., Zelezny M., Mporas I., Karpov A. A Multimodal User Interface for an Assistive Robotic Shopping Cart//Electronics, 2020, Vol. 9, No. 12, pp. 2093 | 2020 | Scopus, Web of Science |
Кагиров И.А., Карпов А.А., Кипяткова И.С., Клюжев К., Кудрявцев А.И., Кудрявцев И.А., Рюмин Д.А. Интеллектуальный интерфейс для управления роботизированным медицинским экзоскелетом нижних конечностей Remotion [Intellectual interface to control a robotic medical exoskeleton of the lower limbs «remotion»] // Авиакосмическая и экологическая медицина [Aviakosmicheskaya i Ekologicheskaya Meditsina] -2019. - Т. 53. - № 5. - С. 92-98 | 2019 | RSCI, Scopus, ВАК, РИНЦ |
Маркитантов М.В., Карпов А.А. Автоматическое распознавание возраста и пола диктора на основе глубоких нейронных сетей // Информационно-измерительные и управляющие системы -2019. - Т. 17. - № 5. - С. 76-83 | 2019 | ВАК, РИНЦ |
Kashevnik A., Lashkov I., Ryumin D., Karpov A. Smartphone-based driver support in vehicle cabin: Human-computer interaction interface//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, Vol. 11659 LNAI, pp. 129-138 | 2019 | Scopus, Web of Science |
Рюмин Д., Аксёнов А., Карпов А.А. Автоматическое обнаружение лиц для человеко-машинного взаимодействия // Альманах научных работ молодых ученых Университета ИТМО -2019. - Т. 3. - С. 33-37 | 2019 | РИНЦ |
Fedotov D., Kim B., Karpov A., Minker W. Time-Continuous Emotion Recognition Using Spectrogram Based CNN-RNN Modelling//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, Vol. 11658, pp. 93-102 | 2019 | Scopus, Web of Science |
Verkholyak O., Fedotov D., Kaya H., Zhang Y., Karpov A. Hierarchical Two-level Modelling of Emotional States in Spoken Dialog Systems//ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2019, pp. 6700-6704 | 2019 | Scopus, Web of Science |
Ivanko D., Ryumin D., Karpov A.A. Automatic Lip-Reading of Hearing Impaired People//International Archives of the Photogrammetry Remote Sensing and Spatial Information Sciences, 2019, Vol. 42, No. 2/W12, pp. 97-101 | 2019 | Scopus, Web of Science |
Ryumin D., Kagirov I., Ivanko D.V., Axyonov A., Karpov A.A. Automatic Detection and Recognition of 3D Manual Gestures for Human-machine Interaction//International Archives of the Photogrammetry Remote Sensing and Spatial Information Sciences, 2019, Vol. 42, No. 2/W12, pp. 179-183 | 2019 | Scopus, Web of Science |
Иванько Д., Рюмин Д., Карпов А.А., Железны М. Исследование влияния высокоскоростных видеоданных на точность распознавания аудиовизуальной речи [Measuring the effect of high-speed video data on the audio-visual speech recognition accuracy] // Информационно-управляющие системы [Informatsionno-Upravliaiushchie Sistemy] -2019. - № 2(99). - С. 26-34 | 2019 | Scopus, ВАК, РИНЦ |
Akhtiamov O., Siegert I., Karpov A., Minker W. Cross-Corpus Data Augmentation for Acoustic Addressee Detection//20th Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2019, 2019, pp. 274-283 | 2019 | Scopus, Web of Science |
Verkholyak O.V., Kaya H., Karpov A.A. Modeling short-term and long-term dependencies of the speech signal for paralinguistic emotion classification//Труды СПИИРАН [SPIIRAS Proceedings], 2019, Vol. 18, No. 1(62), pp. 30-56 | 2019 | Scopus, ВАК, РИНЦ |
Федотов Д.В., Верхоляк О.В., Карпов А.А. Контекстное непрерывное распознавание эмоций в русской речи с использованием рекуррентных нейронных сетей // Анализ разговорной русской речи (АРЗ-2019): труды восьмого междисциплинарного семинара -2019. - С. 96-99 | 2019 | РИНЦ |
Ryumin D., Ivanko D.V., Axyonov A., Kagirov I., Karpov A.A., Zelezny M. Human-Robot Interaction with Smart Shopping Trolley using Sign Language: Data Сollection//IEEE International Conference on Pervasive Computing and Communications Workshops, PerCom Workshops 2019, 2019, pp. 949-954 | 2019 | Scopus, Web of Science |
Главач М., Карпов А.А. LipsID detection with CNN // Альманах научных работ молодых ученых Университета ИТМО -2018. - Т. 2. - С. 171-173 | 2018 | РИНЦ |
Vatamaniuk I.V., Budkov V.Y., Kipyatkova I.S., Karpov A. Methods and Algorithms of Audio-Video Signal Processing for Analysis of Indoor Human Activity//Intelligent Systems Reference Library, 2018, Vol. 136, pp. 139-173 | 2018 | Scopus, Web of Science |
Маркитантов М.В., Карпов А.А. Аналитический обзор подходов к автоматическому распознаванию возраста диктора по голосу // Информационные технологии в управлении (ИТУ-2018): материалы 11-й конференции по проблемам управления (Санкт-Петербург, 2-4 октября 2018г.) -2018. - С. 539-542 | 2018 | РИНЦ |
Velichko A., Budkov V., Kagirov I., Karpov A.A. Comparative Analysis of Classification Methods for Automatic Deception Detection in Speech//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, Vol. 11096, pp. 737-746 | 2018 | Scopus, Web of Science |
Gruber I., Ryumin D., Hruz M., Karpov A. Sign Language Numeral Gestures Recognition using Convolutional Neural Network//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, Vol. 11097, pp. 70-77 | 2018 | Scopus, Web of Science |
Hlavac M., Gruber I., Zhelezny M., Karpov A. LipsID using 3D Convolutional Neural Networks//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, Vol. 11096, pp. 209-214 | 2018 | Scopus, Web of Science |
Kaya H., Fedotov D., Yesilkanat A., Verkholyak O., Zhang Y., Karpov A. LSTM based Cross-corpus and Cross-task Acoustic Emotion Recognition//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018, pp. 521-525 | 2018 | Scopus, Web of Science |
Иванько Д., Федотов Д.В., Карпов А.А. Повышение точности автоматического распознавания визуальной русской речи: оптимизация виземных классов // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2018. - Т. 18. - № 2(114). - С. 346-349 | 2018 | ВАК, РИНЦ |
Грубер И., Карпов А.А. ResNet vs DenseNet: comparison of the state-of-the-art architectures for face classification // Альманах научных работ молодых ученых Университета ИТМО -2018. - Т. 2. - С. 184-187 | 2018 | РИНЦ |
Ivanko D., Karpov A.A., Fedotov D., Kipyatkova I., Ryumin D., Ivanko D., Minker W., Zelezny M. Multimodal speech recognition: increasing accuracy using high speed video data//Journal on Multimodal User Interfaces, 2018, Vol. 12, No. 4, pp. 319-328 | 2018 | Scopus, Web of Science |
Verkholyak O., Karpov A. Combined Feature Representation for Emotion Classification from Russian Speech//Communications in Computer and Information Science, 2018, Vol. 789, pp. 68-73 | 2018 | Scopus, Web of Science |
Markovnikov N., Kipyatkova I., Karpov A., Filchenkov A. Deep neural networks in Russian speech recognition//Communications in Computer and Information Science, 2018, Vol. 789, pp. 54-67 | 2018 | Scopus, Web of Science |
Pugachev A., Akhtiamov O., Karpov A., Minker W. Deep Learning for Acoustic Addressee Detection in Spoken Dialogue Systems//Communications in Computer and Information Science, 2018, Vol. 789, pp. 45-53 | 2018 | Scopus, Web of Science |
Ivanko D., Karpov A., Ryumin D., Kipyatkova I.S., Saveliev A., Budkov V., Ivanko D., Zelezny M. Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, Vol. 10458, pp. 757-766 | 2017 | Scopus, Web of Science |
Akhtiamov O., Sidorov M., Karpov A., Minker W. Speech and text analysis for multimodal addressee detection in human-human-computer interaction//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2017, pp. 2521-2525 | 2017 | Scopus, Web of Science |
Akhtiamov O., Ubskii D., Feldina E., Pugachev A., Karpov A., Minker W. Are you addressing me? Multimodal addressee detection in human-human-computer conversations//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, Vol. 10458, pp. 152–161 | 2017 | Scopus, Web of Science |
Kaya H., Ali Salah A., Karpov A., Frolova O., Grigorev A., Lyakso E.E. Emotion, age, and gender classification in children’s speech by humans and machines//Computer Speech and Language, 2017, Vol. 46, pp. 268-283 | 2017 | Scopus, Web of Science |
Ryumin D., Karpov A. Parametric representation of the speaker’s lips for multimodal sign language and speech recognition//International Archives of the Photogrammetry Remote Sensing and Spatial Information Sciences, 2017, Vol. 42-2, No. 4, pp. 155-161 | 2017 | Scopus, Web of Science |
Hlavac M., Gruber I., Zelezny M., Karpov A. Semi-automatic Facial Key-point Dataset Creation//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, Vol. 10458, pp. 662–668 | 2017 | Scopus, Web of Science |
Gruber I., Hlavac M., Zelezny M., Karpov A. Facing Face Recognition with ResNet: Round One//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, Vol. 10459, pp. 67-74 | 2017 | Scopus, Web of Science |
Ryumin D., Karpov A.A. Towards Automatic Recognition of Sign Language Gestures Using Kinect 2.0//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, Vol. 10278, pp. 89-101 | 2017 | Scopus, Web of Science |
Kaya H., Karpov A.A. Introducing Weighted Kernel Classifiers for Handling Imbalanced Paralinguistic Corpora: Snoring, Addressee and Cold//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2017, pp. 3527-3531 | 2017 | Scopus, Web of Science |
Verkhodanova V.O., Ronzhin A., Kipyatkova I.S., Ivanko D.V., Karpov A.A., Zhelezny M. HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9811, pp. 338-345 | 2016 | Scopus, Web of Science |
Иванько Д., Карпов А.А. Применение высокоскоростной камеры в задачах человеко-машинного взаимодействия // Информационные технологии в управлении (ИТУ-2016): материалы 9-й конференции по проблемам управления (Санкт-Петербург, 4-6октября 2016г.) -2016. - С. 801-806 | 2016 | РИНЦ |
Карпов А.А., Кайа Х., Салах А. Актуальные задачи и достижения систем паралингвистического анализа речи // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2016. - Т. 16. - № 4(104). - С. 581–592 | 2016 | ВАК, РИНЦ |
Kipyatkova I.S., Karpov A.A. Dnn-based acoustic modeling for Russian speech recognition using Kaldi//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9811, pp. 246-253 | 2016 | Scopus, Web of Science |
Рюмин Д., Карпов А.А. Алгоритм выделения рук человека на изображениях с сенсора Kinect // Альманах научных работ молодых ученых Университета ИТМО -2016. - Т. 4. - С. 249-252 | 2016 | РИНЦ |
Рюмин Д., Карпов А.А. Автоматизированная система распознавания отдельных жестов рук с применением сенсора Kinect // Информационные технологии в управлении (ИТУ-2016): материалы 9-й конференции по проблемам управления (Санкт-Петербург, 4-6октября 2016г.) -2016. - С. 838-846 | 2016 | РИНЦ |
Иванько Д., Карпов А.А. Анализ перспектив применения высокоскоростных камер для распознавания динамической информации [An analysis of perspectives for using high-speed cameras in processing dynamic video information] // Труды СПИИРАН [SPIIRAS Proceedings] -2016. - № 1(44). - С. 98-113 | 2016 | Scopus, ВАК, РИНЦ |
Ronzhin A., Basov O.O., Motienko A.I., Karpov A.A., Mikhailov Y.V., Zelezny M. Multimodal information coding system for wearable devices of advanced uniform//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9734, pp. 539-545 | 2016 | Scopus, Web of Science |
Karpov A., Ronzhin A.L., Kipyatkova I.S., Ronzhin A., Verkhodanova V.O., Saveliev A., Zelezny M. Bimodal Speech Recognition Fusing Audio-Visual Modalities//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9732, pp. 170-179 | 2016 | Scopus |
Иванько Д., Кипяткова И.С., Ронжин А.Л., Карпов А.А. Анализ методов многомодального объединения информации для аудиовизуального распознавания речи // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2016. - Т. 16. - № 3(103). - С. 387-401 | 2016 | ВАК, РИНЦ |
Gruber I., Hlavac M., Hruz M., Zelezny M., Karpov A.A. An Analysis of Visual Faces Datasets//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9812, pp. 18-26 | 2016 | Scopus, Web of Science |
Kaya H., Karpov A.A., Ali Salah A. Robust Acoustic Emotion Recognition based on Cascaded Normalization and Extreme Learning Machines//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9719, pp. 115-123 | 2016 | Scopus, Web of Science |
Kipyatkova I., Karpov A. Language Models with RNNs for Rescoring Hypotheses of Russian ASR//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, Vol. 9719, pp. 418-425 | 2016 | Scopus, Web of Science |
Kipyatkova I.S., Karpov A.A. Recurrent neural network-based language modeling for an automatic Russian speech recognition system//Proceedings of Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference, AINL-ISMW FRUCT 2015, 2015, pp. 33-38 | 2015 | Scopus, Web of Science |
Карпов А.А. 4-й Международный семинар по речевым технологиям для малоресурсных языков SLTU-2014 [4 th International workshop on spoken language technologies for under-resourced languages] // Вопросы языкознания [Voprosy Jazykoznanija] -2015. - № 2. - С. 150-152 | 2015 | Scopus, ВАК, РИНЦ |
Карпов А.А., Верходанова В.О. Речевые технологии для малоресурсных языков мира [Speech technologies for under-resourced languages of the world] // Вопросы языкознания [Voprosy Jazykoznanija] -2015. - № 2. - С. 117-135 | 2015 | Scopus, ВАК, РИНЦ |
Karpov A.A., Ronzhin A.L., Kipyatkova I. Automatic Analysis of Speech and Acoustic Events for Ambient Assisted Living//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, Vol. 9176, pp. 455-463 | 2015 | Scopus, Web of Science |
Ronzhin A.L., Karpov A.A. A Software System for the Audiovisual Monitoring of an Intelligent Meeting Room in Support of Scientific and Education Activities//Pattern Recognition and Image Analysis (Advances in Mathematical Theory and Applications), 2015, Vol. 25, No. 2, pp. 237–254 | 2015 | Scopus, ВАК, РИНЦ |
Kaya H., Karpov A., Ali Salah A. Fisher Vectors with Cascaded Normalization for Paralinguistic Analysis//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, pp. 909-913 | 2015 | Scopus, Web of Science |
Kipyatkova I., Karpov A. A Comparison of RNN LM and FLM for Russian Speech Recognition//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, Vol. 9319, pp. 42-50 | 2015 | Scopus, Web of Science |
Lyakso E., Frolova O., Dmitrieva E., Grigorev A., Kaya H., Ali Salah A., Karpov A. EmoChildRu: Emotional Child Russian Speech Corpus//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, Vol. 9319, pp. 144-152 | 2015 | Scopus, Web of Science |
Karpov A., Ronzhin A. A Universal Assistive Technology with Multimodal Input and Multimedia Output Interfaces//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, Vol. 8513, pp. 369-378 | 2014 | Scopus |
Karpov A., Kipyatkova I., Zelezny M. A Framework for Recording Audio-Visual Speech Corpora with a Microphone and a High-Speed Camera//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, Vol. 8773, No. LNAI, pp. 50-57 | 2014 | Scopus, Web of Science |
Kipyatkova I., Karpov A. Study of Morphological Factors of Factored Language Models for Russian ASR//Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, Vol. 8773, No. LNAI, pp. 451-458 | 2014 | Scopus, Web of Science |
Kipyatkova I.S., Verkhodanova V.O., Karpov A. Rescoring N-Best Lists for Russian Speech Recognition using Factored Language Models//4th Workshop on Spoken Language Technologies for Under-resourced languages, SLTU 2014, 2014, pp. 81-86 | 2014 | |
Karpov A.A., Akarun L., Yalcin H., Ronzhin A., Demiroz B., Coban A., Zelezny M. Audio-Visual Signal Processing in a Multimodal Assisted Living Environment//Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2014, pp. 1023-1027 | 2014 | Scopus, Web of Science |
Карпов А.А., Zelezny M. Двуязычная многомодальная система для аудиовизуального синтеза речи и жестового языка по тексту // Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics] -2014. - № 5(93). - С. 92-98 | 2014 | ВАК, РИНЦ |
Карпов А.А. Реализация автоматической системы многомодального распознавания речи по аудио- и видеоинформации // Автоматика и телемеханика -2014. - № 12. - С. 125-138 | 2014 | ВАК, РИНЦ |
Karpov A. An Automatic Multimodal Speech Recognition System with Audio and Video Information//Automation and Remote Control, 2014, Vol. 75, No. 12, pp. 2190-2200 | 2014 | Scopus, Web of Science |