<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" >
<channel>
	<title><![CDATA[MultiLing Community Site: Task: MMS - Multi-document Summarization - Data and information]]></title>
	<link>http://multiling.iit.demokritos.gr/pages/view/1540/task-mms-multi-document-summarization-data-and-information</link>
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">http://multiling.iit.demokritos.gr/pages/view/1540/task-mms-multi-document-summarization-data-and-information</guid>
	<pubDate>Sat, 17 Jan 2015 12:30:43 +0200</pubDate>
	<link>http://multiling.iit.demokritos.gr/pages/view/1540/task-mms-multi-document-summarization-data-and-information</link>
	<title><![CDATA[Task: MMS - Multi-document Summarization - Data and information]]></title>
	<description><![CDATA[<div><p><strong>Test data</strong></p><p><span style="text-decoration: underline;">ATTENTION:</span> Before you can download please contact ggianna AT iit DOT demokritos DOT gr to get you team's username and password.</p><p>Test data downloadable from this <a href="http://users.iit.demokritos.gr/~ggianna/MultiLing2015/MultiDocument/SourceTextsV2b-20150214.zip" title="Test data link">link</a>.</p><p>We note that 3 of the test data topics (M001,M002,M003) will not be taken into account during evaluation, since they have been used as training data (see below).</p><p><strong>Training data - ** NEW **</strong></p><p>Training data downloadable from this <a href="http://users.iit.demokritos.gr/~ggianna/MultiLing2015/MultiDocument/TrainingData2015-20150314.zip" title="Training data link">link</a>, using the participant username and password provided via e-mail.</p><p><strong>Task overview&nbsp;</strong></p>
<p>This MultiLing task aims to evaluate the application of (partially or fully) language-independent summarization algorithms on a variety of languages. Each system participating in the task will be called to provide summaries for a range of different languages, based on corresponding corpora.&nbsp; In the MultiLing Pilot of 2011 the languages used were 7, while in the MultiLing 2015 8 languages will be used. Participating systems will be required to apply their methods on a minimum of two languages. Evaluation will favour systems that apply their methods in more languages.</p>
<p>The MultiLing task requires to generate a single, fluent, representative summary from a set of documents describing an event sequence. The language of the document set will be within a given range of languages and all documents in a set share the same language. The output summary should be of the same language as its source documents. The output summary should be 250 words (for non-Chinese languages) or 750 bytes (for Chinese language, in UTF-8 encoding) at most.</p>
<p><strong>Sample input and output</strong></p>
<p>The input and output data samples are based on the MultiLing 2013 equivalent task.</p><p><a href="http://multiling.iit.demokritos.gr/file/view/1250/multiling-2013-sample-inputs-for-multilingual-multi-document-summarization">Sample input files</a> for several languages (UTF-8 encoded, plain text files). Unzip the provided file to see the&nbsp;sample input files.</p><p><a href="http://multiling.iit.demokritos.gr/file/view/1251/multiling-2013-sample-outputs-for-multilingual-multi-document-summarization">Sample output files</a> for several languages (UTF-8 encoded, plain text files, 250 words max for non-Chinese languages or 750 bytes max for Chinese language). Unzip the provided file to see the sample output files.</p><p><em>References:</em></p><ul>
<li>Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J., and Varma, V. (2011). TAC2011 MultiLing Pilot Overview.</li>
<li id="gs_cit0">Giannakopoulos, George. "Multi-document multilingual summarization and evaluation tracks in ACL 2013 MultiLing Workshop." <em>MultiLing 2013</em> (2013): 20.</li>
</ul><div>&nbsp;</div></div>]]></description>
	<dc:creator>George Giannakopoulos (Admin)</dc:creator>
</item>

</channel>
</rss>