Coding Week-5 Meeting
Attendees
- Gaurav Mishra
- Anupam Ghosh
- Michael C. Jaeger
- Shaheem Azmal
- Ayush Bhardwaj
- Vasudev Maduri
- Omar Mohamed
- Kaushlendra Pratap
- Shreya Singh
Discussions
- Multiprocessing implementation to all the scripts to make the process fast.
- As in last week it was discussed to apply the generation part to all the licenses currently in fossology database, this part was done and the results of which were discussed.
- Research and implement other NLP algorithms that can be used either in data validation or generation.
Week-5 Progress
- Extracted license headers and their text of licenses present in Fossology database from JSON
- Implemented Script to download the licenses.
- Created Minerva-Dataset repo, and pushed all my progress so far in dataset generation and validation of licenses.
Conclusion and Further Plans
To discuss the nomos validated results with the mentors and proceed with Augly implementation.