Publications

(2024). Denoising Labeled Data for Comment Moderation Using Active Learning. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING).

Dataset DOI

(2024). Power and Vulnerability: Managing Sensitive Language in Organisational Communication. Frontiers in Psychology.

Project

(2024). Making the Pick: Understanding Professional Editor Comment Curation in Online News. 18th International AAAI Conference on Web and Social Media (ICWSM), 2024.

Project

(2023). Tracing Linguistic Markers of Influence in a Large Online Organisation. Association for Computational Linguistics: ACL 2023.

PDF Code Project DOI

(2023). LEDA: a Large-Organization Email-Based Decision-Dialogue-Act Analysis Dataset. Findings of the Association for Computational Linguistics: ACL 2023.

PDF Code Project DOI

(2023). Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis. IEEE Access ( Volume: 11).

PDF Code DOI

(2023). Power and Vulnerability: Managing Sensitive Language in Organisational Communication. 33rd Annual Meeting of the Society for Text and Discourse (ST&D 2023).

Project

(2023). Neural Machine Translation for Low-resource Languages: A Survey. Association for Computing Machinery.

PDF DOI

(2022). CoRAL: a Context-aware Croatian Abusive Language Dataset. Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022.

PDF Code Project DOI

(2022). CWID-hi: A Dataset for Complex Word Identification in Hindi Text. Proceedings of the Thirteenth Language Resources and Evaluation Conference.

PDF Project

(2021). Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model. 13th biennial International Conference Recent Advances in Natural Language Processing (RANLP).

PDF Code Project DOI

(2021). Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection. Proceedings of the Hackashop on News Media Content Analysis and Automated Report Generation (EACL).

PDF Project DOI

(2021). EMBEDDIA Tools, Datasets and Challenges: Resources and Hackathon Contributions. Proceedings of the Hackashop on News Media Content Analysis and Automated Report Generation (EACL).

PDF Project DOI

(2019). The Devil is in the Detail: A Magnifying Glass for the GuessWhich Visual Dialogue Game. Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SemDial), - Full Papers.

PDF Code Project

(2019). Evaluating the Representational Hub of Language and Vision Models. Proceedings of the 13th International Conference on Computational Semantics (IWCS) - Long Papers.

PDF Project DOI

(2019). Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Volume 1 (Long and Short Papers).

PDF Code Project DOI

(2018). Learning to see from experience: But which experience is more propaedeutic?. In Shortcomings in Vision and Language, ECCV (SiVL@ECCV).

Project

(2017). Vision and Language Integration: Moving beyond Objects. 12th International Conference on Computational Semantics (IWCS) — Short papers.

PDF Project

(2017). FOIL it! Find One mismatch between Image and Language caption. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL) (Volume 1: Long Papers).

PDF Project DOI

(2013). Document specific sparse coding for word retrieval. 12th International Conference on Document Analysis and Recognition (ICDAR).

PDF Project DOI

(2012). Content level access to digital library of india pages. Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP).

PDF Project

(2012). Word image retrieval using bag of visual words. 10th IAPR International Workshop on Document Analysis Systems (DAS).

PDF Project DOI