Skip to content

stopwords-iso/stopwords-collator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stopwords Collator

Build Status Coverage Status Code Climate Dependency Status Known Vulnerabilities

Organize raw stopwords into a JSON and a text file. It does the following:

  • Convert to lower case
  • Strip CRs
  • Strip BOMs
  • Strip duplicates
  • Strip whitespaces
  • Split tab separated words
  • Sort final result
  • Ignore lines starting with #

Command Line

The following command line options are supported:

  • -s <path> - Directory of the stopwords text files (.txt)
  • -o <path> - filename and path to output file.
$ stopwords-collator -s raw -o lib/stopwords

Contributing

  • Include 100% test coverage and no eslint issue.
  • Submit an issue first for significant changes.