As far as Fedora is concerned, the licensing guidelines are very clear and
detailed on this matter. Unless all the dependencies are free and open
source, it has no chance of going in. This is regardless of whether such a
approach is legal or not.
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 17:32 UTC (Mon) by bedahr (guest, #60420)
[Link]
The HTK is, strictly speaking, no dependency of simon. It extends simon functionality: Without it is not possible to create speech mdoels but you can still use existing ones.
So I would much rather compare it to firefox / flash. Just because firefox functionality can be extended using flash, firefox itself is not non-free...?
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 17:50 UTC (Mon) by jspaleta (subscriber, #50639)
[Link]
Relying on code that can only be generated with a proprietary tool is touchy.
Are there existing speech models which can be shipped under an appropriate license? Are speech models binary blobs akin to byte-compiled code java or python (ie. not allowed in Fedora)? Or are the speech models themselves self documenting scripts that are then interpreted at run time by Simon? Could I realistically write or edit a speech model manually. If Fedora did choose to ship a pre-existing speech model generated by the proprietary tool and there was a bug found in the speech model, what steps would the Fedora maintainer of Simon need to take to fix the problem?
If Simon needs at least one speech model locally to be useful...we'd have to understand what the speech models are in terms of codebase and the implications thereof.
-jef
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 18:11 UTC (Mon) by dlang (✭ supporter ✭, #313)
[Link]
the GPL does not require that the entire toolchain be free
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 18:50 UTC (Mon) by jspaleta (subscriber, #50639)
[Link]
I did not say that it did. Fedora's policy is more nuanced than what is strictly allowed by the GPL. If individual users want to use the proprietary toolchain, they are free to, but at the project level Fedora puts a heavy emphasis on an open toolchain for contributors to use in testing and maintenance. I can use intel's icc compiler on my own systems for its parallelization support..Fedora's not going to stop me...but I can't be expected to submit code to Fedora packages which require icc specific features to be useful. If dependency has to be built with a proprietary toolchain, its inclusion would be exceptional and would require significant discussion methinks.
The devil's in the details. I think a lot of people would need to study up on the details of this codebase interaction with proprietary bits, this is not a fire and forget situation by any means. The point is, this isn't a common situation in terms of licensing a functional software stack, nor is it ideal. It outside standing policy and common practise.
-jef
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 18:47 UTC (Mon) by bedahr (guest, #60420)
[Link]
Speech models are not code. Think of them as documents (in this metaphor simon is a document editor).
Of course there are existing speech models.
You could even use speech models created by SPHINX-Train by using a speech model converter to convert the model to HTK format (there is such a converter available on sourceforge).
BUT: Speech models created by the HTK can be used _freely_ anyways. You can create models using HTK and then basically use them for whatever you want. This is also the reason why the voxforge initiative can build their speech model using the HTK and still licence the model itself under the GPL license.
I don't know what you mean by "bug in a speech model" but I am going to assume that you mean e.g. wrongly transcribed trainingssamples. Well fixing that would depend on how you built the model in the first place. In all likelyhood you would end up changing the input files and re-generating the whole model with those new parameters (using the HTK, SPHINX or whatever was used in the first place).
For the record: There is an open source initiative called ghmm which tries to create a GPL licenced library for working with HMM models but I contacted them and they said they were not ready for this kind of usage and generally want to be more general-purpose than the HTK so I am not sure if they will be soon/ever.
Also, the HTK is very high quality software and a good recognition rate is obviously the main goal for any speech recognition software - GPL or not.
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 18:56 UTC (Mon) by jspaleta (subscriber, #50639)
[Link]
great!
...document format..not compiled code.
...open tool to convert other formats into that format.
...other formats creatable by open codebase.
This should be a non-issue if this comes up for discussion in a package review.
-jef
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 19:01 UTC (Mon) by bedahr (guest, #60420)
[Link]
Thanks for actually _discussing_ this!
I can't remember how often I had the exact same issue raised but it always ended in someone crying out: "Uses non-GPL code! Kill it with fire!" (or similar) and not relating to any replies or explanations from my side at all.
So again, thanks for understanding the complicated situation!
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 19:46 UTC (Mon) by jspaleta (subscriber, #50639)
[Link]
Make sure you are able to make the speech model as document format argument clear when someone steps up to submit the package. You might want to drop a blurb in a high level readme in the simon codebase which talks to this (if its not there already). When/if this comes up for submission as a Fedora package, there's no guarantee the reviewers will have read the discussion here..but they will review the material in the simon codebase in discussion with the packager. Dropping a note into a readme will help make reviewers aware that speech models are editable text file content and note at a minimum the existence of sphinx-train and the speech model format converter tool.
-jef
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 20:31 UTC (Mon) by bedahr (guest, #60420)
[Link]
Yes I will add this information tomorrow.
Maybe I'll even add it to the FAQ of the project wiki...
But btw.: Has anyone even talked to the fedora team? Or is this a hypothetical discussion? If so it is oddly fedora specific IMHO?
Greetings,
Peter
Simon - speech activated user interface for KDE (KDE.News)
Posted Aug 24, 2009 20:47 UTC (Mon) by jspaleta (subscriber, #50639)
[Link]
This is somewhat hypothetical.... someone has to do the packaging work and submit it for review... and I'm not aware of anyone working on packaging Simon yet for Fedora. Hell this is the first I heard of it. I'm holding out for direct neural interfaces instead of speech...moving my mouth takes soooo much effort.
I'll bet you dollars to doughnuts members of Fedora's Technical leadership will read the discussion here and will be aware of the content argument. But ultimately it comes down to someone taking the responsibility to maintain the Simon package and start the package submission review process. A summary of the situation in faq or readme will help prevent an unnecessary delay once someone does step forward.
I would also think a Debian packaging effort would also benefit from a summary of this discussion...if they aren't ready working on packages. I think they'll have similar concerns but I'm less informed about the details of Debian policy with regard to "content" versus "code" than I am about Fedora's policy.