Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
conversational_speech_translation [2020/02/03 20:51]
esalesky - duplicating important dates section from main
conversational_speech_translation [2020/02/05 20:38] (current)
esalesky -- fisher mapping code clarification
Line 53: Line 53:
 Participants should sign the license agreement and follow the directions in the pdf to return a signed copy to LDC (by [[ldc@ldc.upenn.edu|email]] or fax). Once received, LDC will provide a download link for the data package within 1-2 days. Participants who do not already have an LDC account will need to create one to download the data; the LDC membership office will assist with any questions. Participants should sign the license agreement and follow the directions in the pdf to return a signed copy to LDC (by [[ldc@ldc.upenn.edu|email]] or fax). Once received, LDC will provide a download link for the data package within 1-2 days. Participants who do not already have an LDC account will need to create one to download the data; the LDC membership office will assist with any questions.
  
-To enable immediate participation,​ we provide preprocessed speech features, with mapped (parallel) speech and text (transcript and translation) utterances.  +To enable immediate participation,​ we provide preprocessed speech features, with mapped (parallel) speech and text (transcript and translation) utterances ​in the IWSLT data package.  
-We additionally provide the original LDC data releases for those who wish to extract their own features etc.  +For those who instead ​wish to extract their own features etc., we additionally provide the original LDC data packages.  
-We note that the original speech and translations require a mapping step to be made parallel, and we provide [[https://​github.com/​esalesky/​fisher-mapping|code]] to do so within the data package (further details in the data package README). ​+We note that the original speech and translations require a mapping step to be made parallel, and we provide [[https://​github.com/​esalesky/​fisher-mapping|code]] to do so within the data package (further details in the data package README). This is only necessary if you wish to extract your own features
  
 Participants who wish to use additional data beyond what is provided (**unconstrained**) must also submit systems which use only the data provided (**constrained**);​ constrained and unconstrained systems will be scored separately. ​ Participants who wish to use additional data beyond what is provided (**unconstrained**) must also submit systems which use only the data provided (**constrained**);​ constrained and unconstrained systems will be scored separately. ​