Awards & Nominations
SoloMan has received the following awards and nominations. Way to go!
SoloMan has received the following awards and nominations. Way to go!
The NASA Technical Report Server (NTRS) includes hundreds of thousands of items containing scientific and technical information (STI) funded by NASA. To enable searches of this large NTRS database, this project involves Ai development that can read a collection of PDF files, summarize those files, produce statistical reports of the language usage, and list topic keywords. In this project, several techniques will be used including machine learning modelling and statistical analysis to produce the desired report summary of each document. By doing so, future researchers would be able to use this information to find desired historical data quickly.

NASA NLP Analytics is an Ai SOFTWARE that carries out natural language processing to analyse the scientific report in order to produce statistical reports of language used and content prediction. In this case, This software will automatedly analyse each document included inside a corpus prepared. The software developed not only can integrate with the NASA website but also with third-party software in producing an automated application. The project demo will display the 56 technical reports retrieved from NASA Technical Report Server and the report analysis produced by our software to enable comparison.
***Please download the Result report and zoom in to get a clearer image regarding the words displayed on Bi-gram and Trigram Graph
In short, the AI application developed can be divided into several steps, including text extraction, keyword analysis, text summarization, content prediction and pdf production. In this area, I will explain in detail the 3 main features of the software :
We believe that this software will be able to retrieve valuable reports from the report analysis so that the data collected can be applied to help researchers in accessing the right information that they want. Accessing all those thousand and hundreds of documents would be trouble, especially for users of the NASA website. The readily developed software will help to overcome this problem. With the information analysed, it can not only be further developed to build a query system or even a recommendation system for users to access reports. We hope that this software can change the situation and save users time in reading the reports with thousands of words
(Link: https://ntrs.nasa.gov/search?center=CDMS)
All the data was retrieved with the purpose of building a corpus of report samples. The following Drive consists of all the 56 data reports downloaded from the website:
https://drive.google.com/drive/folders/1jN5qZb-rkgrGGdoofbK2emVZ4WEQEQse?usp=sharing
This hackathon is really an opportunity for me to apply my skill and knowledge to a potential project. Natural Language Processing is really things, especially in the financial sector and a lot more. Having my previous experience in a past hackathon (Banking industries hackathon), I get to know how real-world data scientists would do in building and leveraging technology in text analytics. Those experiences really help me to make up my mind in participating in this hackathon to apply my knowledge. As a solo player this time, it's quite challenging for me in terms of time management. Few days prior to the hackathon, I even started to search for related information and screened through the problem statements. So I started my preparartion early and carry out the developement step by step from data cleaning, testing and debugging. In short, it is a really special experience for me and I enjoy the most!
#NLP#AI#DATA#ANALYTICS#ML#STATISTICAL#SOLO
The NASA Technical Report Server (NTRS) includes hundreds of thousands of items containing scientific and technical information (STI) created or funded by NASA. Imagine how difficult it can be to locate desired information in such a large repository! Your challenge is to develop a technique using Artificial Intelligence (AI) to improve the accessibility and discoverability of records in the public NTRS.

