Shruti Palaskar
Title
Cited by
Cited by
Year
How2: a large-scale dataset for multimodal language understanding
R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ...
arXiv preprint arXiv:1811.00347, 2018
912018
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
O Scharenborg, L Besacier, A Black, M Hasegawa-Johnson, F Metze, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
31*2018
Combining LSTM and latent topic modeling for mortality prediction
Y Jo, L Lee, S Palaskar
arXiv preprint arXiv:1709.02842, 2017
242017
Multimodal abstractive summarization for how2 videos
S Palaskar, J Libovický, S Gella, F Metze
arXiv preprint arXiv:1906.07901, 2019
212019
End-to-end multimodal speech recognition
S Palaskar, R Sanabria, F Metze
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
212018
Cmu sinbad’s submission for the dstc7 avsd challenge
R Sanabria, S Palaskar, F Metze
DSTC7 at AAAI2019 workshop 6, 2019
202019
Building an asr system for a low-resource language through the adaptation of a high-resource language asr system: Preliminary results
O Scharenborg, F Ciannella, S Palaskar, A Black, F Metze, L Ondel, ...
Proceedings of ICNLSSP, Casablanca, Morocco, 2017
182017
Acoustic-to-word recognition with sequence-to-sequence models
S Palaskar, F Metze
2018 IEEE Spoken Language Technology Workshop (SLT), 397-404, 2018
172018
ASR error correction and domain adaptation using machine translation
A Mani, S Palaskar, NV Meripo, S Konam, F Metze
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
122020
Multimodal grounding for sequence-to-sequence speech recognition
O Caglayan, R Sanabria, S Palaskar, L Barraul, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
122019
Learned in speech recognition: Contextual acoustic word embeddings
S Palaskar, V Raunak, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
102019
Multimodal abstractive summarization of open-domain videos
J Libovický, S Palaskar, S Gella, F Metze
Proceedings of the Workshop on Visually Grounded Interaction and Language …, 2018
102018
Learning from multiview correlations in open-domain videos
N Holzenberger, S Palaskar, P Madhyastha, F Metze, R Arora
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
92019
Towards understanding ASR error correction for medical conversations
A Mani, S Palaskar, S Konam
Proceedings of the First Workshop on Natural Language Processing for Medical …, 2020
42020
How2Sign: a large-scale multimodal dataset for continuous American sign language
A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ...
arXiv preprint arXiv:2008.08143, 2020
32020
Grounded Sequence to Sequence Transduction
L Specia, L Barrault, O Caglayan, A Duarte, D Elliott, S Gella, ...
IEEE Journal of Selected Topics in Signal Processing 14 (3), 577-591, 2020
12020
Transfer learning for multimodal dialog
S Palaskar, R Sanabria, F Metze
Computer Speech & Language 64, 101093, 2020
2020
ASR ERROR CORRECTION AND DOMAIN ADAPTATION USING MACHINE TRANSLATION Download PDF
A Mani, S Palaskar, NV Meripo, S Konam, F Metze
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis
EH PI, T Berg-Kirkpatrick, J Carbonell, H Chalupsky, A Gershman, ...
The system can't perform the operation now. Try again later.
Articles 1–19