<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="/rss.css" type="text/css"?>
<rdf:RDF xmlns="http://purl.org/rss/1.0/"
    xmlns:cc="http://web.resource.org/cc/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:extra="http://www.w3.org/1999/xhtml"
    xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
    <channel rdf:about="http://www.epjdatascience.com/feeds/latestarticles/journal?quantity=&amp;format=rss&amp;version=">
        <title>EPJ Data Science - Latest Articles</title>
        <link>http://www.epjdatascience.com</link>
        <description>The latest research articles published by EPJ Data Science</description>
        <dc:date>2013-04-04T00:00:00Z</dc:date>
        <items>
            <rdf:Seq>
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/2/1/4" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/2/1/3" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/2/1/2" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/2/1/1" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/12" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/11" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/10" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/9" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/8" />
                                <rdf:li rdf:resource="http://www.epjdatascience.com/content/1/1/7" />
                            </rdf:Seq>
        </items>
                 <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </channel>
        <item rdf:about="http://www.epjdatascience.com/content/2/1/4">
        <title>The Dynamics of Health Behavior Sentiments on a Large Online Social Network</title>
        <description>AbstractModifiable health behaviors, a leading cause of illness and death in many countries, are often driven by individual beliefs and sentiments about health and disease. Individual behaviors affecting health outcomes are increasingly modulated by social networks, for example through the associations of like-minded individuals - homophily - or through peer influence effects. Using a statistical approach to measure the individual temporal effects of a large number of variables pertaining to social network statistics, we investigate the spread of a health sentiment towards a new vaccine on Twitter, a large online social network. We find that the effects of neighborhood size and exposure intensity are qualitatively very different depending on the type of sentiment. Generally, we find that larger numbers of opinionated neighbors inhibit the expression of sentiments. We also find that exposure to negative sentiment is contagious - by which we merely mean predictive of future negative sentiment expression - while exposure to positive sentiments is generally not. In fact, exposure to positive sentiments can even predict increased negative sentiment expression. Our results suggest that the effects of peer influence and social contagion on the dynamics of behavioral spread on social networks are strongly content-dependent.</description>
        <link>http://www.epjdatascience.com/content/2/1/4</link>
                <dc:creator>Marcel Salathe</dc:creator>
                <dc:creator>Duy Vu</dc:creator>
                <dc:creator>Shashank Khandelwal</dc:creator>
                <dc:creator>David Hunter</dc:creator>
                <dc:source>EPJ Data Science 2013, null:4</dc:source>
        <dc:date>2013-04-04T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds16</dc:identifier>
                                <prism:require>/content/figures/epjds16-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>4</prism:startingPage>
        <prism:publicationDate>2013-04-04T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/2/1/3">
        <title>Word usage mirrors community structure in the online social network Twitter</title>
        <description>AbstractBackgroundLanguage has functions that transcend the transmission of information and varies with social context. To find out how language and social network structure interlink, we studied communication on Twitter, a broadly-used online messaging service.ResultsWe show that the network emerging from user communication can be structured into a hierarchy of communities, and that the frequencies of words used within those communities closely replicate this pattern. Consequently, communities can be characterised by their most significantly used words. The words used by an individual user, in turn, can be used to predict the community of which that user is a member.ConclusionsThis indicates a relationship between human language and social networks, and suggests that the study of online communication offers vast potential for understanding the fabric of human society. Our approach can be used for enriching community detection with word analysis, which provides the ability to automate the classification of communities in social networks and identify emerging social groups.</description>
        <link>http://www.epjdatascience.com/content/2/1/3</link>
                <dc:creator>John Bryden</dc:creator>
                <dc:creator>Sebastian Funk</dc:creator>
                <dc:creator>Vincent Jansen</dc:creator>
                <dc:source>EPJ Data Science 2013, null:3</dc:source>
        <dc:date>2013-02-25T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds15</dc:identifier>
                                <prism:require>/content/figures/epjds15-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>3</prism:startingPage>
        <prism:publicationDate>2013-02-25T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/2/1/2">
        <title>Lognormal distributions of  user post lengths in  Internet discussions --- a consequence of the Weber-Fechner law?</title>
        <description>The paper presents an analysis of the length of comments posted in Internet discussion fora, based on a collection of large datasets from several sources. We found that despite differences in the forum language, the discussed topics and user emotions, the comment length distributions are very regular and described by the lognormal form with a very high precision. We discuss possible origins of this regularity and the existence of a universal mechanism deciding the length of the user posts. We suggest that the observed lognormal dependence may be due to an entropy maximizing combination of two psychological factors which are perceived on a non-linear, logarithmic scale in accordance with the Weber-Fechner law, namely the time spent on post related considerations and the comment length itself. This hypothesis is supported by an experimental check of text length recognition capacity, confirming proportionality of the &#8216;just noticeable differences&#8217; for text lengths - the basis of the Weber-Fechner law.</description>
        <link>http://www.epjdatascience.com/content/2/1/2</link>
                <dc:creator>Pawel Sobkowicz</dc:creator>
                <dc:creator>Mike Thelwall</dc:creator>
                <dc:creator>Kevan Buckley</dc:creator>
                <dc:creator>Georgios Paltaglou</dc:creator>
                <dc:creator>Antoni Sobkowicz</dc:creator>
                <dc:source>EPJ Data Science 2013, null:2</dc:source>
        <dc:date>2013-02-18T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds14</dc:identifier>
                                <prism:require>/content/figures/epjds14-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>2</prism:startingPage>
        <prism:publicationDate>2013-02-18T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/2/1/1">
        <title>Realities of data sharing using the genome wars as case study - an historical perspective and commentary</title>
        <description>The importance of data sharing has become a mantra within the science research community. However, sharing has not been as easy (or as readily adopted) as advocates have suggested. Questions of privacy, individual scientist&#8217;s rights to their research, and industry-academia divides have been significant hurdles. This article looks at the history of the debates and problems associated with data access that occurred during the &#8216;human genome wars&#8217; and their aftermath as a way to explore some of the challenges facing diverse research communities.</description>
        <link>http://www.epjdatascience.com/content/2/1/1</link>
                <dc:creator>Barbara Jasny</dc:creator>
                <dc:source>EPJ Data Science 2013, null:1</dc:source>
        <dc:date>2013-02-12T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds13</dc:identifier>
                                <prism:require>/content/figures/epjds13-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>1</prism:startingPage>
        <prism:publicationDate>2013-02-12T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/12">
        <title>Link creation and information spreading over social and communication ties in an interest-based online social network</title>
        <description>AbstractComplex dynamics of social media emerge from the interaction between the patterns of social connectivity of users and the information exchanged along such social ties. Unveiling the underlying mechanisms that drive the evolution of online social systems requires a deep understanding of the interplay between these two aspects. Based on the case of the aNobii social network, an online service for book readers, we investigate the dynamics of link creation and the social influence phenomenon that may trigger information diffusion in the social graph. By confirming that social partner selection is strongly driven by structural, geographical, and topical proximity, we develop a machine-learning social link recommender for individual users trained on a set of features selected as best predictive out of several and we test it on the still widely unexplored domain of a network of interest. We also analyze the influence process from the two distinct perspectives of users and items. We show that link creation plays an immediate effect on the alignment of user profiles and that the established social ties are a good substrate for social influence. We quantitatively measure influence by tracking the patterns of diffusion of specific pieces of information and comparing them with appropriate null models. We discover an appreciable signal of social influence even though item consumption is a very slow process in this context. All the detected patterns of social attachment and influence are observed to be stronger when considering the social subgraph on which communication effectively occurs. Based on our study of the dynamics of the aNobii social network, we investigate the possibility to predict the evolution of such a complex social system.</description>
        <link>http://www.epjdatascience.com/content/1/1/12</link>
                <dc:creator>Luca Aiello</dc:creator>
                <dc:creator>Alain Barrat</dc:creator>
                <dc:creator>Ciro Cattuto</dc:creator>
                <dc:creator>Rossano Schifanella</dc:creator>
                <dc:creator>Giancarlo Ruffo</dc:creator>
                <dc:source>EPJ Data Science 2012, null:12</dc:source>
        <dc:date>2012-12-05T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds12</dc:identifier>
                                <prism:require>/content/figures/epjds12-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>12</prism:startingPage>
        <prism:publicationDate>2012-12-05T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/11">
        <title>Genetic flow directionality and geographical  segregation in a Cymodocea nodosa genetic diversity network</title>
        <description>AbstractWe analyse a large data set of genetic markers obtained from populations of Cymodocea nodosa, a marine plant occurring from the East Mediterranean to the Iberian-African coasts in the Atlantic Ocean. We fully develop and test a recently introduced methodology to infer the directionality of gene flow based on the concept of geographical segregation. Using the Jensen-Shannon divergence, we are able to extract a directed network of gene flow describing the evolutionary patterns of Cymodocea nodosa. In particular we recover the genetic segregation that the marine plant underwent during its evolution. The results are confirmed by natural evidence and are consistent with an independent cross analysis.</description>
        <link>http://www.epjdatascience.com/content/1/1/11</link>
                <dc:creator>Paolo Masucci</dc:creator>
                <dc:creator>Sophie Arnaud-Haond</dc:creator>
                <dc:creator>Victor Eguiluz</dc:creator>
                <dc:creator>Emilio Hernandez-Garcia</dc:creator>
                <dc:creator>Ester Serrao</dc:creator>
                <dc:source>EPJ Data Science 2012, null:11</dc:source>
        <dc:date>2012-11-28T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds11</dc:identifier>
                                <prism:require>/content/figures/epjds11-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>11</prism:startingPage>
        <prism:publicationDate>2012-11-28T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/10">
        <title>Spatiotemporal correlations of handset-based service usages</title>
        <description>We study spatiotemporal correlations and temporal diversities of handset-based service usages by analyzing a dataset that includes detailed information about locations and service usages of 124 users over 16 months. By constructing the spatiotemporal trajectories of the users we detect several meaningful places or contexts for each one of them and show how the context affects the service usage patterns. We find that temporal patterns of service usages are bound to the typical weekly cycles of humans, yet they show maximal activities at different times. We first discuss their temporal correlations and then investigate the time-ordering behavior of communication services like calls being followed by the non-communication services like applications. We also find that the behavioral overlap network based on the clustering of temporal patterns is comparable to the communication network of users. Our approach provides a useful framework for handset-based data analysis and helps us to understand the complexities of information and communications technology enabled human behavior.</description>
        <link>http://www.epjdatascience.com/content/1/1/10</link>
                <dc:creator>Hang-Hyun Jo</dc:creator>
                <dc:creator>Márton Karsai</dc:creator>
                <dc:creator>Juuso Karikoski</dc:creator>
                <dc:creator>Kimmo Kaski</dc:creator>
                <dc:source>EPJ Data Science 2012, null:10</dc:source>
        <dc:date>2012-11-06T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds10</dc:identifier>
                                <prism:require>/content/figures/epjds10-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>10</prism:startingPage>
        <prism:publicationDate>2012-11-06T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/9">
        <title>A large-scale community structure analysis in Facebook</title>
        <description>Understanding social dynamics that govern human phenomena, such as communications and social relationships is a major problem in current computational social sciences. In particular, given the unprecedented success of online social networks (OSNs), in this paper we are concerned with the analysis of aggregation patterns and social dynamics occurring among users of the largest OSN as the date: Facebook. In detail, we discuss the mesoscopic features of the community structure of this network, considering the perspective of the communities, which has not yet been studied on such a large scale. To this purpose, we acquired a sample of this network containing millions of users and their social relationships; then, we unveiled the communities representing the aggregation units among which users gather and interact; finally, we analyzed the statistical features of such a network of communities, discovering and characterizing some specific organization patterns followed by individuals interacting in online social networks, that emerge considering different sampling techniques and clustering methodologies. This study provides some clues of the tendency of individuals to establish social interactions in online social networks that eventually contribute to building a well-connected social structure, and opens space for further social studies.</description>
        <link>http://www.epjdatascience.com/content/1/1/9</link>
                <dc:creator>Emilio Ferrara</dc:creator>
                <dc:source>EPJ Data Science 2012, null:9</dc:source>
        <dc:date>2012-11-06T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds9</dc:identifier>
                                <prism:require>/content/figures/epjds9-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>9</prism:startingPage>
        <prism:publicationDate>2012-11-06T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/8">
        <title>Beating the news using Social Media: the case study of American Idol</title>
        <description>We present a contribution to the debate on the predictability of social events using big data analytics. We focus on the elimination of contestants in the American Idol TV shows as an example of a well defined electoral phenomenon that each week draws millions of votes in the USA. This event can be considered as basic test in a simplified environment to assess the predictive power of Twitter signals. We provide evidence that Twitter activity during the time span defined by the TV show airing and the voting period following it correlates with the contestants ranking and allows the anticipation of the voting outcome. Twitter data from the show and the voting period of the season finale have been analyzed to attempt the winner prediction ahead of the airing of the official result. We also show that the fraction of tweets that contain geolocation information allows us to map the fanbase of each contestant, both within the US and abroad, showing that strong regional polarizations occur. The geolocalized data are crucial for the correct prediction of the final outcome of the show, pointing out the importance of considering information beyond the aggregated Twitter signal. Although American Idol voting is just a minimal and simplified version of complex societal phenomena such as political elections, this work shows that the volume of information available in online systems permits the real time gathering of quantitative indicators that may be able to anticipate the future unfolding of opinion formation events.</description>
        <link>http://www.epjdatascience.com/content/1/1/8</link>
                <dc:creator>Fabio Ciulla</dc:creator>
                <dc:creator>Delia Mocanu</dc:creator>
                <dc:creator>Andrea Baronchelli</dc:creator>
                <dc:creator>Bruno Goncalves</dc:creator>
                <dc:creator>Nicola Perra</dc:creator>
                <dc:creator>Alessandro Vespignani</dc:creator>
                <dc:source>EPJ Data Science 2012, null:8</dc:source>
        <dc:date>2012-07-31T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds8</dc:identifier>
                                <prism:require>/content/figures/epjds8-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>8</prism:startingPage>
        <prism:publicationDate>2012-07-31T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <item rdf:about="http://www.epjdatascience.com/content/1/1/7">
        <title>Crowd disasters as systemic failures: analysis of the Love Parade disaster</title>
        <description>AbstractEach year, crowd disasters happen in different areas of the world. How and why do such disasters happen? Are the fatalities caused by relentless behavior of people or a psychological state of panic that makes the crowd &#8216;go mad&#8217;? Or are they a tragic consequence of a breakdown of coordination? These and other questions are addressed, based on a qualitative analysis of publicly available videos and materials, which document the planning and organization of the Love Parade in Duisburg, Germany, and the crowd disaster on July 24, 2010. Our analysis reveals a number of misunderstandings that have widely spread. We also provide a new perspective on concepts such as &#8216;intentional pushing&#8217;, &#8216;mass panic&#8217;, &#8216;stampede&#8217;, and &#8216;crowd crushes&#8217;. The focus of our analysis is on the contributing causal factors and their mutual interdependencies, not on legal issues or the judgment of personal or institutional responsibilities. Video recordings show that people stumbled and piled up due to a &#8216;domino effect&#8217;, resulting from a phenomenon called &#8216;crowd turbulence&#8217; or &#8216;crowd quake&#8217;. Crowd quakes are a typical reason for crowd disasters, to be distinguished from crowd disasters resulting from &#8216;mass panic&#8217; or &#8216;crowd crushes&#8217;. In Duisburg, crowd turbulence was the consequence of amplifying feedback and cascading effects, which are typical for systemic instabilities. Accordingly, things can go terribly wrong in spite of no bad intentions from anyone. Comparing the incident in Duisburg with others, we give recommendations to help prevent future crowd disasters. In particular, we introduce a new scale to assess the criticality of conditions in the crowd. This may allow preventative measures to be taken earlier on. Furthermore, we discuss the merits and limitations of citizen science for public investigation, considering that today, almost every event is recorded and reflected in the World Wide Web.</description>
        <link>http://www.epjdatascience.com/content/1/1/7</link>
                <dc:creator>Dirk Helbing</dc:creator>
                <dc:creator>Pratik Mukerji</dc:creator>
                <dc:source>EPJ Data Science 2012, null:7</dc:source>
        <dc:date>2012-06-25T00:00:00Z</dc:date>
        <dc:identifier>doi:10.1140/epjds7</dc:identifier>
                                <prism:require>/content/figures/epjds7-toc.gif</prism:require>
                <prism:publicationName>EPJ Data Science</prism:publicationName>
        <prism:issn>2193-1127</prism:issn>
        <prism:volume>${item.volume}</prism:volume>
        <prism:startingPage>7</prism:startingPage>
        <prism:publicationDate>2012-06-25T00:00:00Z</prism:publicationDate>
                <prism:versionidentifier>XML</prism:versionidentifier>
                <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/" />
    </item>
        <cc:License rdf:about="http://creativecommons.org/licenses/by/2.0/">
        <cc:permits rdf:resource="http://creativecommons.org/ns#Reproduction" />
        <cc:permits rdf:resource="http://creativecommons.org/ns#Distribution" />
        <cc:permits rdf:resource="http://creativecommons.org/ns#DerivativeWorks" />
    </cc:License>
</rdf:RDF>
