this is tres bien.
same for data sets?
If you use Wikipedia as an input, for example, your data is CC-By-SA, not CC-By.