High-Level Project Summary
We developed a search tool that uses NLP and OpenSearch to index and search in a semantic way pdf 's from the nasa api. We think its important to improve the searchbox of the report server to optimize searchs from Nasa Engineers and to also help other curious people acces data in a more simple way.
Link to Final Project
Link to Project "Demo"
Detailed Project Description
We made a python script that indexes keywords using NLP from the pdf's and stores that data in a non SQL db . Once this data is stored a user can write sentences in the searchbox and the web program will extract from the searchbox keywords that are compared to db keywords, if theres correlation the program will show a item in the web that links you to that pdf
Space Agency Data
We used the nasa report server: Search - NASA Technical Reports Server (NTRS)
We used the nasa NTRS: Swagger UI (nasa.gov)
Hackathon Journey
It was a great experience where we learned a lot about multiple new tech and data management, we all had a great time together
References
PDFMiner: pdfminer · PyPI
OpenSearch: OpenSearch
Keybert: keybert · PyPI
Docker: Home - Docker
Python: Download Python | Python.org
Python NLTK: NLTK :: Natural Language Toolkit
Flask: Welcome to Flask — Flask Documentation (2.2.x) (palletsprojects.com)
React: React – Una biblioteca de JavaScript para construir interfaces de usuario (reactjs.org)
BootStrap: Bootstrap · The most popular HTML, CSS, and JS library in the world. (getbootstrap.com)
Tags
#data #IA

