atarashi.agents.wordFrequencySimilarity module

Copyright 2018 Aman Jain (amanjain5221@gmail.com)

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 2 as published by the Free Software Foundation. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.

class atarashi.agents.wordFrequencySimilarity.WordFrequencySimilarity(licenseList, verbose=0)[source]

Bases: atarashi.agents.atarashiAgent.AtarashiAgent

scan(filePath)[source]

Python Module to classify license using histogram similarity algorithm

Parameters:filePath – Input file path that needs to be scanned
Returns:License short name with maximum intersection with word frequency of licenses