atarashi.libs.initialmatch module

Copyright 2018 Aman Jain (amanjain5221@gmail.com)

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 2 as published by the Free Software Foundation. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.

atarashi.libs.initialmatch.HeadersNgramSim(header, processedData)[source]

Creates array of ngrams Check with the processed data how much are matching sim_score = matches/ count of ngrams

Parameters:
  • header – License Header
  • processedData – Input file extracted and processed data
Returns:

Array of JSON with scanning results

atarashi.libs.initialmatch.initial_match(filePath, processedData, licenses)[source]
Parameters:
  • inputFile – Input file path
  • licenseList – Processed License List path
Returns:

Array of JSON with scanning results from spdx_identifer and HeadersNgramSim

atarashi.libs.initialmatch.spdx_identifer(data, shortnames)[source]

Identify SPDX-License-Identifier Make sure the identifier must be present in Fossology merged license list

Parameters:
  • data – Input File data
  • shortnames – Array of shortnames (SPDX-ID)
Returns:

Array of JSON with scanning results