NOVA IMS Assistant: Enhancing Information Access and Campus Engagement through an Intelligent Chatbot

التفاصيل البيبلوغرافية
العنوان: NOVA IMS Assistant: Enhancing Information Access and Campus Engagement through an Intelligent Chatbot
المؤلفون: Sousa, Joana Pardelha Marcelo
المساهمون: Neto, Miguel de Castro Simões Ferreira, Jardim, João Bruno Morais de Sousa, RUN
سنة النشر: 2024
مصطلحات موضوعية: Chatbot, RAG, GPT, Natural Language Processing, Artificial Intelligence, SDG 8 - Decent work and economic growth, Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação
الوصف: Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Data Science
Description (Translated): Chatbots have revolutionized human-technology interactions with their remarkable capabilities in various applications, offering intuitive and efficient communication solutions for diverse environments, including academic contexts. This work focuses on leveraging the advancements of natural language processing models and chatbots to develop a GPT-3.5- based chatbot enhanced with Retrieval-Augmented Generation tailored for the NOVA IMS community. The chatbot was built using LangChain for construction and Chroma for vector storage, enabling the chatbot to provide accurate and contextually relevant responses. Two custom datasets were created to conduct the evaluation of multiple aspects of the chatbot's performance, including similarity measure for the Retriever, chunking strategies, and prompt templates, which included both manual review and RAGAS. Overall, the chatbot performs well, providing accurate and relevant replies within the Nova IMS settings. Despite this, qualitative analysis revealed areas for improvement, such as incomplete answers and irrelevant information.
Contents Note: TID:203776364
وصف الملف: application/pdf
اللغة: English
الاتاحة: http://hdl.handle.net/10362/174586
Rights: open access
رقم الانضمام: rcaap.com.unl.run.unl.pt.10362.174586
قاعدة البيانات: RCAAP