<?xml version='1.0'?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:georss="http://www.georss.org/georss" >
<channel>
	<title><![CDATA[MultiLing Community Site: Task: OnForumS - Data and information]]></title>
	<link>http://multiling.iit.demokritos.gr/pages/view/1531/task-onforums-data-and-information</link>
	<description><![CDATA[]]></description>
	
	<item>
	<guid isPermaLink="true">http://multiling.iit.demokritos.gr/pages/view/1531/task-onforums-data-and-information</guid>
	<pubDate>Wed, 07 Jan 2015 10:28:13 +0200</pubDate>
	<link>http://multiling.iit.demokritos.gr/pages/view/1531/task-onforums-data-and-information</link>
	<title><![CDATA[Task: OnForumS - Data and information]]></title>
	<description><![CDATA[<p><strong><strong><br /></strong></strong></p><p><strong><strong>OnForumS System Reports</strong></strong></p><ul>
<li><a href="http://multiling.iit.demokritos.gr/file/view/1613/onforums-overview" title="OnForumS Overview"><strong>OnForumS</strong></a>: A Shared Task on On-line Forum Summarisation</li>
<li><a href="http://multiling.iit.demokritos.gr/file/download/1575" title="CIST System Report"><strong>CIST</strong></a>: CIST System Report for SIGdial MultiLing 2015</li>
<li><a href="http://multiling.iit.demokritos.gr/file/download/1576" title="JRC System Report"><strong>JRC</strong></a>:&nbsp;Tackling the OnForumS Challenge</li>
<li><a href="http://multiling.iit.demokritos.gr/file/download/1577" title="USFD_UNITN System Report"><strong>USFD_UNITN</strong></a>:&nbsp;<span>Sheffield-Trento System for Sentiment and Argument Structure Enhanced Comment-to-Article Linking in the Online News Domain</span></li>
<li><span><a href="http://multiling.iit.demokritos.gr/file/download/1578" title="UWB System Report"><strong>UWB</strong></a>:&nbsp;UWB Participation in the Multiling&rsquo;s OnForumS Task</span></li>
</ul><p><strong><br /></strong></p><p><strong>OnForumS Gold Data Set</strong></p><p><strong><br /></strong>A gold data set out of the test data set and the input from the crowdsourcing evaluation has been compiled and released. Please get in touch with the organisers if you would like to have a copy of the data set.</p><p>&nbsp;</p><p><strong>OnForumS Evaluation (including P/R/F1 measures per link-label)</strong></p><p>The evaluation spread sheets have been updated to include Precision, Recall and F1 &nbsp;measures&nbsp;<span>for every link-label per system run,&nbsp;</span><span>macro-averaged over the full set of documents. Please get in touch with the organisers if you want to have a copy of the spreadsheets.</span></p><p>&nbsp;</p><p><strong>OnForumS Submission package<br /></strong></p><p>In order to validate your submission, please use the following software package: &nbsp;<a href="http://multiling.iit.demokritos.gr/file/view/1564/onforums-submission-validation-updated" title="OnForumS Submission Validation">onforums-submission-validation-0.2</a>&nbsp;(click the button 'Download this' at the top right corner&nbsp;).</p><p><strong><br /></strong></p><p><strong>Test Data release</strong></p><p>The test data set for our evaluation campaign has been released (if you haven&rsquo;t received a notification email, please get in touch).</p><p><br />System submissions are due by <strong>March 8th, 2015</strong>.</p><p><strong><br /></strong></p><p><strong>Download </strong></p><p>You can download the sample data, release 0.1, by going&nbsp;<a href="http://multiling.iit.demokritos.gr/file/view/1541/sampledataonforums-01" title="OnForumS Dataset download">here</a>&nbsp;and&nbsp;clicking the button 'Download this' at the top right corner&nbsp;(and just in case, the initial release is still&nbsp;<a href="http://www.iit.demokritos.gr/~ggianna/MultiLing2015/sampleDataOnForumS.tgz" title="OnForumS Dataset download">here</a><span style="font-size: 12.8000001907349px;">)</span></p><p><strong><br /></strong></p><p><strong>Online Forum Summarization (OnForumS), MultiLing 2015</strong></p><p>README for the sample data release</p><p>The sample data is formed of one news article from The Guardian and a select set of readers' comments.</p><p>There are five files constituting the sample data release:<br /> 1. 81043636.ofs.in.xml<br /> 2. 81043636.ofs.out.xml<br /> 3. 81043636.utf8.txt<br /> 4. outputFormatOFS.txt<br /> 5. ofs.dtd</p><p>Participants will be expected to take file 1 as input and produce file 2 as output<br />by populating the section accordingly. File 3 is provided as an<br />auxiliary text version of the input, file 4 is a sketch of the XML format with<br />comments and file 5 is a DTD specification of the XML format. The text in file 1<br />is sentence-split and pre-tokenised (i.e., with spaces between tokens), whereas in<br />file 3 it is not.</p><p>The test data to be handed out for the final evaluation will be formed of a set of<br />news articles, where for each article there will be a pair of files, one XML file like<br />file 1 above and one auxiliary text file like file 3 above.</p><p>In addition to the data, participants will receive a validation program that they<br />can run over their outputs in order to make sure these conform with the OnForumS<br />format expectations (DTD + some specific checks, see * below for DTD validation).</p><p>Please note that the set of links provided within file 2 in order to illustrate the<br />task is a non-exhaustive set of links which was the result of pre-pilot crowdsourcing<br />evaluations using Crowd Flower.</p><p><span style="font-size: 12.8000001907349px;"><br /></span></p><p><span style="font-size: 12.8000001907349px;">--</span></p><p>* A Java DTD validator that can be used is the DOMValidator class at the following link:</p><p>http://www.herongyang.com/XML/DTD-Validation-of-XML-with-DTD-Using-DOM.html</p><p>Download the class, compile it and run it as follows:</p><p>java -Xmx1000M -Xms1000M -cpDOMValidator 81043636.ofs.out.xml</p><p><br /><br /><strong>Information</strong></p><p>For questions on OnForumS, please contact:</p><p>Mijail Kabadjov - University of Essex:&nbsp;http://privatewww.essex.ac.uk/~malexa/</p><p>Josef Steinberger - University of West Bohemia:&nbsp;http://textmining.zcu.cz/?section=member&amp;id=1</p>]]></description>
	<dc:creator>George Giannakopoulos (Admin)</dc:creator>
</item>

</channel>
</rss>