Difference between revisions of "Mark my words!"

Latest revision as of 01:07, 2 October 2012

Citation

Cristian Danescu-Niculescu-Mizil, Michael Gamon, Susan T. Dumais: Mark my words!: linguistic style accommodation in social media. WWW 2011: 745-754

Online version

http://research.microsoft.com/en-us/um/people/sdumais/fr862-danescu-niculescu-mizil.pdf

Summary

Physiological studies have suggested that participants in conversation accommodate in dimensions such as speaking style, utterance length, gesture, speaking rate etc. In this paper authors proves the hypothesis that linguistic accommodation could be seen in social media such as Twitter. They investigate accommodation in LIWC dimensions[1]. Some examples of these dimensions include use of article,negation words(not/no),preposition,quantifier,1st person singular pronoun,1st person plural pronoun,2nd person pronoun in conversation. Authors propose a novel probabilistic framework to prove their hypothesis.

Framework

When individual talk about some topic, they would have to use similar words to describe topics hence it is important to remove topic accommodation from overall accommodation measure. Since they use LIWC dimensions, it is automatically removed.

Their framework is based on mainly two components, stylistic cohesion and stylistic accommodation.

Stylstic Cohesion: It is used to find if tweets belonging to same conversation exhibit a certain LIWC style more than tweets which are unrelated. If the former is more then we can say that tweets which are part of same conversation agree more on a particular style. Formally, for a style $C$ it is defined as:

$Coh(C)=P(T^{C}\wedge R^{C}|T\leftrightarrow R)-P(T^{C}\wedge R^{C})$

where $T\leftrightarrow R$ is condition which represent that tweets are from same conversation. $P(T^{C}\wedge R^{C}|T\leftrightarrow R)$ is the probability of tweets which are part of same conversation and exhibit style C. Whereas, $P(T^{C}\wedge R^{C})$ is probability of observing style C in any randomly picked two tweets.

Stylistic accommodation:

While measuring stylistic accommodation it is assumed that a twitter can accommodate in a style with his partner only if his partner exhibited style C in same conversation earlier. The formal definition of stylistic definition is as follows:

$Acc(a;b)^{C}=P(T_{b}^{C}|T_{a}^{C},T_{b}\rightarrow T_{a})-P(T_{b}^{C}|T_{b}\rightarrow T_{a})$

Here, $P(T_{b}^{C}|T_{a}^{C},T_{b}\rightarrow T_{a})$ represents the probability that style $C$ was exhibited in tweets of user b after observing the same style in user $a$ . Whereas, $P(T_{b}^{C}|T_{b}\rightarrow T_{a})$ represent that style C was observed in user b irrespective of whether user a used the style C or not. Note the fact that $Acc(a;b)$ is directional accommodation from a to b. They also defines accommodation from b to a. They use these two accommodation scores $Acc(a;b)$ and $Acc(b;a)$ to find if accommodation is symmetric or not.

Results:

Authors observe that $P(T^{C}\wedge R^{C}|T\leftrightarrow R)$ is more than $P(T^{C}\wedge R^{C})$ in considered LIWC styles. This confirms the fact that Stylistic Cohesion is present in Twitter. They also observe that $P(T_{b}^{C}|T_{a}^{C},T_{b}\rightarrow T_{a})$ is more than $P(T_{b}^{C}|T_{b}\rightarrow T_{a})$ . This confirms that linguistic accommodation in LIWC styles is present in twitter.

Related Paper:

Rivka Levitan, Agustín Gravano, Julia Hirschberg: Entrainment in Speech Preceding Backchannels. ACL (Short Papers) 2011: 113-117
Rivka Levitan, Julia Hirschberg: Measuring Acoustic-Prosodic Entrainment with Respect to Multiple Levels and Dimensions. INTERSPEECH 2011: 3081-3084

@@ Line 8: / Line 8: @@
 http://research.microsoft.com/en-us/um/people/sdumais/fr862-danescu-niculescu-mizil.pdf
-== Summary ==
+==Summary ==
-Physiological studies have suggested that participants in conversation accommodate in dimensions such as speaking style, utterance length, gesture, speaking rate etc. In this paper authors proves the hypothesis that linguistic  accommodation could also be seen in social media such as Twitter. They investigate accommodation in LIWC dimensions[http://www.liwc.net/liwcdescription.php].Some of examples for these dimensions include use of Article,Negation words(not/no),Preposition,Quantifier,1st person singular pronoun,1st person plural pronoun,2nd person pronoun in conversation. Authors propose a novel probabilistic framework to prove their hypothesis.
+Physiological studies have suggested that participants in conversation accommodate in dimensions such as speaking style, utterance length, gesture, speaking rate etc. In this paper authors proves the hypothesis that linguistic  accommodation could be seen in social media such as Twitter. They investigate accommodation in LIWC dimensions[http://www.liwc.net/liwcdescription.php]. Some examples of these dimensions include use of article,negation words(not/no),preposition,quantifier,1st person singular pronoun,1st person plural pronoun,2nd person pronoun in conversation. Authors propose a novel probabilistic framework to prove their hypothesis.
@@ Line 21: / Line 21: @@
 '''Stylstic Cohesion:'''
 It is used to find if tweets belonging to same conversation exhibit a certain LIWC style more than tweets which are unrelated. If the former is more then we can say that tweets which are part of same conversation agree more on a particular style. Formally, for a style <math> C </math> it is defined as:
 <math>Coh(C) =P(T^C \wedge  R^C | T \leftrightarrow  R)-P(T^C \wedge  R^C)</math>
-where <math> T \leftrightarrow  R </math> is condition which represent that tweets represent a conversation turn. P(T^C \wedge  R^C | T \leftrightarrow  R) is the probability that tweets which are part of same conversation and exhibit the style C. Whereas, P(T^C \wedge  R^C) represents that any randomly picked two tweets exhibit the style C.
+where <math> T \leftrightarrow  R </math> is condition which represent that tweets are from same conversation.  <math> P(T^C \wedge  R^C | T \leftrightarrow  R) </math> is the probability of tweets which are part of same conversation and exhibit style C. Whereas, <math>P(T^C \wedge  R^C) </math> is probability of observing style C in any randomly picked two tweets.
-'''stylistic accommodation:'''
-While measuring stylistic accommodation it is assumed that a twitter can accommodate in a style <math>C</math> with his partner only if his partner exhibited style <math>C</same> in same conversation earlier. The formal definition of stylistic definition is as follows:
+'''Stylistic accommodation:'''
+While measuring stylistic accommodation it is assumed that a twitter can accommodate in a style with his partner only if his partner exhibited style C in same conversation earlier. The formal definition of stylistic definition is as follows:
 <math>Acc(a;b)^C =P(T_b^C|T_a^C,T_b\rightarrow  T_a)-P(T_b^C|T_b\rightarrow  T_a)</math>
-Here, P(T_b^C|T_a^C,T_b\rightarrow  T_a) represents the probability that style <math>C</math> was exhibited in tweets of user b after observing the same style in user <math>a</math>. Whereas <math> P(T_b^C|T_b\rightarrow  T_a) </math> represent that style C was observed in user b irrespective of whether a used the similar style or not. Note the fact that <math>Acc(a;b)</math> is accommodation from a to b. They also defines accommodation from b to a. They use these two accommodation scores <math>Acc(a;b) </math> and <math>Acc(b;a) </math> to find if accommodation is symmetric or not.
+Here, <math>P(T_b^C|T_a^C,T_b\rightarrow  T_a)</math> represents the probability that style <math>C</math> was exhibited in tweets of user b after observing the same style in user <math>a</math>. Whereas, <math> P(T_b^C|T_b\rightarrow  T_a) </math> represent that style C was observed in user b irrespective of whether user a used the style C or not. Note the fact that <math>Acc(a;b)</math> is directional accommodation from a to b. They also defines accommodation from b to a. They use these two accommodation scores <math>Acc(a;b) </math> and <math>Acc(b;a) </math> to find if accommodation is symmetric or not.
 ==Results: ==
-They observe that <math>P(T^C \wedge  R^C | T \leftrightarrow  R)</math> is more than <math>P(T^C \wedge  R^C)</math> in the LIWC styles that considered. This confirms the fact that Stylistic Cohesion is present in twitter. They also observe that <math>P(T_b^C|T_a^C,T_b\rightarrow  T_a)</math> is more than <math>P(T_b^C|T_b\rightarrow  T_a)</math>. This confirms that accommodation is present in twitter.
+Authors observe that <math>P(T^C \wedge  R^C | T \leftrightarrow  R)</math> is more than <math>P(T^C \wedge  R^C)</math> in considered LIWC styles. This confirms the fact that Stylistic Cohesion is present in Twitter. They also observe that <math>P(T_b^C|T_a^C,T_b\rightarrow  T_a)</math> is more than <math>P(T_b^C|T_b\rightarrow  T_a)</math>. This confirms that linguistic accommodation in LIWC styles is present in twitter.
-Related Paper:
+==Related Paper: ==
 * Rivka Levitan, Agustín Gravano, Julia Hirschberg: Entrainment in Speech Preceding Backchannels. ACL (Short Papers) 2011: 113-117
 * Rivka Levitan, Julia Hirschberg: Measuring Acoustic-Prosodic Entrainment with Respect to Multiple Levels and Dimensions. INTERSPEECH 2011: 3081-3084

Difference between revisions of "Mark my words!"

Latest revision as of 01:07, 2 October 2012

Contents

Citation

Online version

Summary

Framework

Results:

Related Paper:

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools