Mediknow - A Malayalam Cancer Question Answering System
Abstract
This paper introduces "MediKnow," a
pioneering Malayalam Question Answering System
designed to address the scarcity of generative answer
works in the realm of healthcare information
accessibility, specifically tailored for cancer-related
queries. The dearth of such systems in Dravidian
languages, particularly Malayalam, has motivated the
development of a robust solution. Leveraging advanced
Natural Language Processing (NLP) techniques,
including OpenAI models and FAISS for efficient vector
storage, MediKnow employs a specialized Malayalam
language model to navigate the intricacies of the
Dravidian linguistic context. The processing pipeline
encompasses document loading, text splitting, and
embeddings, enhancing the system's capacity to
comprehend and accurately respond to a diverse range of
cancer-related questions. This work underscores the
critical need for bridging the gap in generative answer
works for Dravidian languages, highlighting the specific
challenges posed by the Malayalam language due to its
complexity. Beyond providing accessible information,
MediKnow exemplifies the efficacy of employing state-ofthe-art NLP technologies to address linguistic nuances.
The paper evaluates the system's performance on a
dataset of cancer-related questions, demonstrating its
ability to deliver accurate and informative answers. The
innovative approach presented herein contributes to the
advancement of NLP capabilities in non-English
languages, particularly focusing on healthcare-related
information retrieval. The development and deployment
of "MediKnow" signify a significant stride in tackling
linguistic and domain-specific challenges in cancerrelated question answering, ultimately making critical
healthcare information more accessible to Malayalam
speakers.
Keywords:
Natural Language Processing, Question Answering System, Dravidian Languages, Cancer Information, OpenAI, FaissPublished
Issue
Section
License
Copyright (c) 2024 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Mrs. Lis Jose, Akhil Lorence, Akhil Manohar, Amal Jose Chacko, Arjun J, Lung Disease Detection From Chest X-ray Images Using Hybrid Machine Learning Model , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Anumol V S, Elna S Bijo, Neha Maria Joji, Siya Varghese, Teena George, AI-Based Medicinal Plant Identification Using Deep Learning for Mobile Applications , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- FATHIMA P.S, ANU ROSE JOY, ANSPIN TITUS, ANSU MARIUM SHIBU, ASNA AZEEZ, INDIAN SIGN LANGUAGE RECOGNITION USING YOLOV5 , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Parvathy V A, Irfana Parveen C A, Alisha K A, Reshma P R, Manu Krishna C P, Detection of Diabetic Retinopathy and Glaucoma using Deep Learning , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- NITHYA M V, ADIL SIYAD K.M, AFINSHA P.B, GAUTHAM T.S, ABHIJITH K.P, SALIH SUDHEER, ARJUN SANKAR R.S, C.S ADHITHYAN, JEWELLERY SHOPPING WITH FACIAL RECOGNITION , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Elsa George , Alphonsa Francis, Anna Job, Ann Maria James, Shiney Thomas, YOLOv8-Driven Approach for Wildlife Detection and Recognition , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Maria Sajeeve, Karthik Vinod, Kausalya Sumesh, Joby Jose, Minu Cherian, KALO:AI-Powered Precision in Nutrition Tracking , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Ansamol Varghese, Anandhu Anoj, Angel Thomas, Deepta K Sunny, Emil Thomas, TrueNews-AI Powered Detection of Manipulated Text and Images , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Denit D Binny, Diya Mathew, Jaice George, Mehak Riyas, Neenu R, A Comprehensive Survey on EMG-Based Real-Time Gesture Recognition for Prosthetic Hand Applications , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- S Sreejith, Akshara Santhosh, Ardra Haridas, S Jayakrishnan, Ojus Thomas Lee, Chitra Merin Varghese, BrailE- Reading Device for the Deaf and Blind in Real Time Speech , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
You may also start an advanced similarity search for this article.
