Could including my content as part of a corpus to generate a machine learning algorithm be considers a form of transformation? If so, CC BY-SA implies people can still do that, but my content must be credited?