Hi Rémi,<div><br></div><div>Thank you so much for replying. I'll try to address your points as much as I'm capable to. </div><div><br></div><div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">


<div class="im">Voice can be synthesized from plain text directly, and from rich text at</div>

the cost of loosing the "enrichment". But to read bitmaps, you would need<br>

optical characters recognition. If that is required (especially for DVD<br>

subtitles playback), there may be a very hard problem. As for "burnt"<br>

subtitles, ouch, I guess OCR would be very hard and unreliable.<br>

<br></blockquote><div><br></div><div>We have already opted out of using bitmaps or burnt subtitles, just because that would require so much more work and in the end probably still be unreliable. Our idea is to let the user download subtitles from the web, that is plain text in *.srt format. The video files, though, will come from DVDs in the evaluation study we will be doing, and not be downloaded. </div>


<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

Then you need a speech synthesis engine. There are quite a few open-source<br>

ones. But I don't know which ones, if any, supports Swedish phonetics and<br>

have a GPL-compatible copyright license. Did you already sort out that part<br>

of the equation?<br></blockquote><div><br></div><div>Since we work with users who probably will having a license for a speech synthesis engine already, we were hoping that the module could have support for a few of the most usual engines, both licensed and free, and a function for checking which is on the user's system (or alternatively, letting the user state this). In terms of free vs licensing, will this be a problem? On top of my head, I can think of espeak (not very good) and Festival, supporting Swedish phonetics, but I am sure there are more. The basic idea is that the engine wouldn't come with the module, but work with the one already in place on the user's system (if applicable). </div>


<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

<br>

And last, you need to filter out the original voice from the original<br>

audio track. Or do you not mind loosing the original audio sound effects? I<br>

don't suppose you will always have a clear/speech-less audio channel<br>

available in the original media.<br></blockquote><div><br></div><div>We were hoping to be able to preserve the original sound, but to evaluate the finished module, we are interested in letting the users set their own preferences in terms of volume for the different channels. Do you think this would be possible? If not, we'd liked it filtered out but not mute.</div>


<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

<div class="im">It depends a lot on the requirements, and what existing components could</div>

be sourced or would be provided by you.<br></blockquote><div><br></div><div>I'll be happy for more questions, or pointers, that would help us understand what resources we need, and who could provide them. Given these premises, do you have a more clear picture of what effort we could be talking about? </div>


<div><br></div><div>All the best,</div><div><br></div><div>Sandra</div></div></div>