The broad question is: how are the licensing terms for a classifier that is trained with open annotated data (or manually annotated) data?
I am trying to train a dependency parser for German text with annotated data which is licensed under creative commons license (Attribution CC BY). To train the classifier I want to use machine learning tool which is licensed under the Apache license.
Is it legally permissible to license the resulting classifier (my code and the model file) under a commercial license?
- Suppose I scrape a text from the web, or alternatively I download a corpus collection that is licensed under Attribution CC BY, and I use an annotation tool which is open source under Apache license, and I train a classifier with an Apache machine learning software.
Will it legally be permissible to license the resulting classifier under commercial terms?