Stopwords files for several languages #66
Conversation
Languages included: arabic, bulgarian, catalan, czech, danish, dutch, finnish, german, hebrew, hindi, hungarian, indonesian, malaysian, italian, norwegian, polish, portuguese, romanian, russian, slovak, spanish, swedish, turkish, ukrainian and vietnamese

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

Hi Florian! Good repo, I had a lot of fun with it :)
I have added stopwords files for some missing languages. They were extracted from here and converted using a Python script.
New languages included: arabic, bulgarian, catalan, czech, danish, dutch, finnish, german, hebrew, hindi, hungarian, indonesian, malaysian, italian, norwegian, polish, portuguese, romanian, russian, slovak, spanish, swedish, turkish, ukrainian and vietnamese