<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nicolasneubauer.net</title>
	<atom:link href="http://www.nicolasneubauer.net/wordpress/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nicolasneubauer.net/wordpress</link>
	<description></description>
	<lastBuildDate>Tue, 12 Jul 2011 14:43:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google+: Your privacy still is an illusion</title>
		<link>http://www.nicolasneubauer.net/wordpress/2011/07/05/google-your-privacy-still-is-an-illusion/</link>
		<comments>http://www.nicolasneubauer.net/wordpress/2011/07/05/google-your-privacy-still-is-an-illusion/#comments</comments>
		<pubDate>Tue, 05 Jul 2011 16:02:42 +0000</pubDate>
		<dc:creator>nn</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[privacy]]></category>

		<guid isPermaLink="false">http://www.nicolasneubauer.net/wordpress/?p=64</guid>
		<description><![CDATA[Short version: Don&#8217;t think that circles turn sharing sensitive stuff into a good idea. First of all, Google+ must be applauded for attacking the delicate problem of controlling who sees what of the information to put online. The concept of circles isn&#8217;t significantly different from Facebook&#8217;s friend groups &#8211; but Google+ makes this aspect central [...]]]></description>
			<content:encoded><![CDATA[<p>Short version: Don&#8217;t think that circles turn sharing sensitive stuff into a good idea.<span id="more-64"></span></p>
<p>First of all, Google+ must be applauded for attacking the delicate problem of controlling who sees what of the information to put online. The concept of circles isn&#8217;t significantly different from Facebook&#8217;s friend groups &#8211; but Google+ makes this aspect central both when creating new contacts and when posting new content. This is more than an interface tweak &#8211; defaults matter (*). As a result, the semantics of Google+ &#8220;friendships&#8221; in my opinion already differ from those on Facebook, seeing how people add me to their circles who would never request a Facebook friendship (e.g., unknown friends of friends). However, this new-found privacy may lull people into a false sense of security: <strong>Excluding someone in your circles from something you share does not mean they won&#8217;t ever see it.</strong> They just won&#8217;t see it from you. </p>
<p>This may be obvious. It wasn&#8217;t for me, so I thought I&#8217;ll share a little experiment. Consider you have two friends, A &#8211; your friend, and B &#8211; your business contact. You share an update which is only intended for your friends, so you exclude the circle including B. A finds this so hilarious he shares it with all of his contacts. These contacts happen to include &#8211; guess what &#8211; your business aquaintance B. Will B see your update? Yes, she will. You may think &#8220;but I told Google+ I don&#8217;t want her to see this&#8221;. No, you didn&#8217;t! You just stated that she won&#8217;t be in the set of people who will receive that information from you (**). Any attachment you put on that post will be generously forwarded to B as well. To add further awkwardness, B will see that A was included in the circles of the original post, but she wasn&#8217;t, as she&#8217;s getting a &#8220;A shared [your name] post with you&#8221; message. Ouch! In Google+&#8217;s defense, you have to say that a) re-sharers get a pop-up telling them that the original post was shared in a limited scope and b) you can disable re-sharing. However, the re-sharer may not care, and the re-share disabling is an option available in a drop-down menu which appears after you post (i.e. much less prominent than the circle selection, and we know what happens to non-default functions). </p>
<p>To conclude: Circles may be misunderstood by some users (e.g., me &#8211; I&#8217;d love some comments from readers telling me how far off I am here) as serving a purpose which, in fact, can only achieved in a brute-force way via &#8220;disable re-sharing&#8221;. Solutions would include users being aware of this, Google+ deemphasizing the privacy power of circles, and/or making the &#8220;disable re-share&#8221; button more salient (or change the semantics to &#8220;people I haven&#8217;t shared this with are excluded from future re-shares&#8221;). Of course: Privacy is hard, and particularly in a social network, giving up some of your privacy is part of why it works. All the more, it&#8217;s important for existing privacy features to be clear, otherwise they may cause problems you never would&#8217;ve had, had you simply not relied on them at all and just applied the basic heuristic for putting stuff on the internet: Don&#8217;t post anything you wouldn&#8217;t like to see on the cover of tomorrow&#8217;s newspaper!</p>
<p>This post is based on a discussion with Oli T. and stars Crisi K. in the role of B.</p>
<p>(*) Which is why <a href="http://www.circlehack.com/">Circle Hack</a>, a Facebook app copying Google+&#8217;s interface for assigning friends into groups, doesn&#8217;t really close the gap between the two.</p>
<p>(**) It does make sense, not stating X is something different than stating that not X, just be clear that removing a circle does the former.</p>
<p>Update: <a href="http://stefan.waidele.info/2011/07/08/google-plus-und-der-privatspharenirrtum/" title="At least one other person finds this behaviour noteworthy as well (in German)" target="_blank">At least one other person finds this behaviour noteworthy as well (in German)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.nicolasneubauer.net/wordpress/2011/07/05/google-your-privacy-still-is-an-illusion/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualizing the contents of social bookmarking systems</title>
		<link>http://www.nicolasneubauer.net/wordpress/2010/12/15/visualizing_social_bookmarks/</link>
		<comments>http://www.nicolasneubauer.net/wordpress/2010/12/15/visualizing_social_bookmarks/#comments</comments>
		<pubDate>Wed, 15 Dec 2010 14:01:41 +0000</pubDate>
		<dc:creator>nn</dc:creator>
				<category><![CDATA[research]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.nicolasneubauer.net/wordpress/?p=37</guid>
		<description><![CDATA[I&#8217;ve decided to post a few visualizations I&#8217;ve made some time ago as part of my PhD work. The method has been published, but more as a sidenote [1]; also, I&#8217;ve since applied it to additional datasets, so I thought it might be interesting to share those images. In my PhD thesis, I try to [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve decided to post a few visualizations I&#8217;ve made some time ago as part of my PhD work. <span id="more-37"></span>The method has been published, but more as a sidenote [1]; also, I&#8217;ve since applied it to additional datasets, so I thought it might be interesting to share those images.</p>
<p>In my PhD thesis, I try to make sense of the large bodies of data that are accumulated by people saving resources online, tagging them with whatever words they choose in order to find them later on. Each time a user <em>u</em> saves a document <em>d</em>, using the tags <em>t1</em>, <em>t2</em>, and <em>t3</em>, three triples (<em>d,u,t1</em>), (<em>d,u,t2</em>), (<em>d,u,t3</em>) are created. In this way, the users of large social bookmarking sites like <a href="http://www.delicious.com">Delicious</a> have created datasets of several hundreds of millions of triples. Over the last years, a whole body of literature has been created that&#8217;s concerned with making sense of this data (there&#8217;s an intuition that something valuable is in there; after all, each of these millions of triples means that somebody has thought *something*!). Here, I want to show a few visualizations of various social bookmarking datasets aimed to provide a quick idea of the complexity and the approximate content of these networks.</p>
<p>Here&#8217;s what I&#8217;ve done:</p>
<ul>
<li>Take the 1000 most popular documents from each dataset</li>
<li>Compute the similarity between them in terms of associated tags</li>
<li>Draw connections between the 3000 closest pairs of documents</li>
<li>Draw connections between each document and its most frequently associated tag</li>
<li>Scale the tags by frequency</li>
<li>Remove everything that&#8217;s not connected to the largest component of the graph</li>
<li>Run the resulting network through <a href="http://www.graphviz.org/">GraphViz</a>, a fantastic graph visualization software</li>
</ul>
<p>For the technically inclined, the similarity between documents is computed as a so-called cosine similarity: Each document is represented by a large vector containing, for each tag, the number of users that have used that tag to describe the document. The cosine similarity between those vectors is then a number between 0 (no similarity) and 1 (complete match). If you imagine two vectors (0,1) and (1,0), the angle between them is 90 degrees and the cosine of that angle is 0. This can be scaled up to many dimensions and is frequently used in information retrieval as a basic similarity measure between measure. Also, the choice of the numbers 1000 and 3000 is somewhat arbitrary. They just happen to create what I find the most expressive visualizations over all datasets. The actual visualization, as noted, is outsourced to GraphViz; for a quick intuition about what it does, just imagine that each connection is a spring between two nodes &#8211; the visualization is a stable state of the resulting system where the different spring forces are in balance.</p>
<p><a href="http://www.visualize.us">Visualize.us</a>, a social bookmarking site for images.<br />
<a href="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/visualizeus.png"><img class="size-medium wp-image-42" title="Visualize.us" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/visualizeus-300x207.png" alt="" width="300" height="207" /></a></p>
<p>One month of <a href="http://www.delicious.com">Delicious</a> (12/2007) [2], a social bookmarking service for URLs<br />
<a href="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/delicious.png"><img class="size-medium wp-image-41" title="Delicious 12/2008" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/delicious-300x285.png" alt="" width="300" height="285" /></a></p>
<p><a href="http://www.citeulike.org">CiteULike</a> [3], a social bookmarking service for scientific articles<br />
<a href="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/citeulike.png"><img class="size-medium wp-image-40" title="CiteULike" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/citeulike-300x264.png" alt="" width="300" height="264" /></a></p>
<p><a href="http://www.bibsonomy.org">Bibsonomy</a> [4], a social bookmarking service for scientific articles and URLS<br />
<a href="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/bibsonomy_nospam.png"><img class="size-medium wp-image-38" title="Bibsonomy (without spam)" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/bibsonomy_nospam-300x296.png" alt="" width="300" height="296" /></a></p>
<p><a href="http://www.bibsonomy.org">Bibsonomy</a>, without the spam removed<br />
<a href="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/bibsonomy_spam.png"><img class="size-medium wp-image-39" title="Bibsonomy (with spam)" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/12/bibsonomy_spam-300x264.png" alt="" width="300" height="264" /></a></p>
<p>Through the way they are created, these visualizations are heavily biased towards the most popular documents, so a lot of content is lost &#8211; the original point of these visualizations was just to demonstrate the brutal way in which spammers destroy inherent patterns in the data (even though I think there&#8217;s more things to find). For work more aimed at a complete overview of a particular dataset, please check <a href="http://cns.slis.indiana.edu/research/09-Bibsomony.jpg">this</a> nice visualization[5].</p>
<p><small>Higher resolution or vectorized versions available on request; if you would like to cite this work in an academic context, please refer to [1]</small></p>
<p>[1] Nicolas Neubauer and Klaus Obermayer: Hyperincident Connected Components in Tagging Networks. In <em>Proceedings of the <a href="http://www.ht2009.org/">20th ACM Conference on Hypertext and Hypermedia</a> </em> [<a href="http://user.cs.tu-berlin.de/~neubauer/hyperincident_components.pdf">pdf</a>]<br />
[2] Robert Wetzker, Carsten Zimmermann, Christian Bauckhage: Analyzing Social Bookmarking Systems: A del.icio.us Cookbook. In <em>Proceedings of the <a href="http://mlkd.csd.auth.gr/msoda/">ECAI 2008 Mining Social Data Workshop (2008)</a></em>, pp. 26-30.<br />
[3] <a href="http://www.citeulike.org/faq/data.adp">http://www.citeulike.org/faq/data.adp</a><br />
[4] Knowledge &amp; Data Engineering Group, University of Kassel Benchmark folksonomy data from bibsonomy, version of june 30th, 2008.<br />
[5] Nianli Ma, Russell J. Duhon, Elisha F. Hardy, Katy Börner (2009) Bibsonomy Anatomy, Sunbelt Viszards Map. Online at <a href="http://cns.iu.edu/research/09-Bibsomony.jpg" target="_blank">http://cns.iu.edu/research/09-Bibsomony.jpg</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.nicolasneubauer.net/wordpress/2010/12/15/visualizing_social_bookmarks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Workshop on Information in Networks 2010</title>
		<link>http://www.nicolasneubauer.net/wordpress/2010/10/06/workshop-on-information-in-networks-2010/</link>
		<comments>http://www.nicolasneubauer.net/wordpress/2010/10/06/workshop-on-information-in-networks-2010/#comments</comments>
		<pubDate>Wed, 06 Oct 2010 22:20:37 +0000</pubDate>
		<dc:creator>nn</dc:creator>
				<category><![CDATA[research]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[travel]]></category>

		<guid isPermaLink="false">http://www.nicolasneubauer.net/wordpress/?p=8</guid>
		<description><![CDATA[A few days ago, I returned from &#8220;Workshop in Information in Networks&#8221;, where I had presented our paper Community Detection in Tagging-Induced Hypergraphs. It was a two-day workshop with an informal feel to it, celebrating interdisciplinarity and featuring research from physicists and historians, computer and political scientists all while being hosted at NYU&#8217;s Stern School [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_9" class="wp-caption alignleft" style="width: 250px"><a href="http://www.flickr.com/photos/alphachimpstudio/5022262888/in/set-72157625028173596/"><img class="size-medium wp-image-9 " title="Graphical Recording of my talk at WIN'10 (by AlphaChimps)" src="http://www.nicolasneubauer.net/wordpress/wp-content/uploads/2010/10/winchimps-300x133.jpg" alt="" width="240" height="106" /></a><p class="wp-caption-text">Graphical Recording of my talk at WIN&#39;10. (by AlphaChimps)</p></div>
<p>A few days ago, I returned from <a title="Workshop in Information in Networks" href="http://www.winworkshop.net/" target="_blank">&#8220;Workshop in Information in Networks&#8221;</a>, <span id="more-8"></span>where I had presented our paper <a href="http://ni.cs.tu-berlin.de/%7Eneubauer/win.pdf" target="_blank">Community Detection in Tagging-Induced Hypergraphs</a>.  It was a two-day workshop with an informal feel to it, celebrating  interdisciplinarity and featuring research from physicists and  historians, computer and political scientists all while being hosted at  NYU&#8217;s Stern School of Business. <!--more--></p>
<p>Short snapshot of what comes to my mind after almost two weeks now after browsing the program again (citations without guarantees):</p>
<ul>
<li>Duncan Watts on cascading in Twitter: &#8220;Almost nothing cascades&#8221;</li>
<li>Nathan Eagle on real-world relevance of research: &#8220;The really hard questions come from people who want to know where to build a fountain to minimize diseases&#8221; (see also AI for Development &#8211; <a href="http://" target="_blank">http://ai-d.org/</a>)</li>
<li>Stanley Wasserman polarizing on network researchers neglecting the roots of their field and hypothesis-driven (statistics) vs data-driven (data mining) research</li>
<li>Jeff Nickerson employing Mechanical Turk for crowdsourced brainstorming</li>
<li>Isabelle Stanton demonstrating the construction of random graphs with a given joint degree distribution</li>
<li>Michael Kearns on coordination problems in networks, showing through elegant experiments how, e.g., &#8220;a well-connected, small elite can enforce their will against an unconnected majority&#8221;</li>
<li>Neha Gondal showing how a network dataset of loans in renaissance Florance could be used to test various theories of the social dynamics of lending in that time</li>
<li>Peter Boothe showing that 18 autonomous systems (ie ISPs etc) could together kill around 40% (or at least some ridiculously high number around there) of all internet traffic by turning off their systems</li>
<li>Sinan Aral showing that susceptibility to peer pressure (as measured by following a Facebook recommendation) increases as the relationship status changes from single to married (most susceptible, though, are &#8220;it&#8217;s complicated&#8221;-customers)</li>
</ul>
<p>One nice feature of the event was the facilitator/graphical recorder from http://www.alphachimp.com who made funny but amazingly effective sketches of the talks for documentation and later discussion. Above you find the recording of my talk, linking to the full set of recordings from the event at Flickr.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nicolasneubauer.net/wordpress/2010/10/06/workshop-on-information-in-networks-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The unavoidable first post</title>
		<link>http://www.nicolasneubauer.net/wordpress/2010/10/06/hello-world/</link>
		<comments>http://www.nicolasneubauer.net/wordpress/2010/10/06/hello-world/#comments</comments>
		<pubDate>Wed, 06 Oct 2010 19:34:58 +0000</pubDate>
		<dc:creator>nn</dc:creator>
				<category><![CDATA[meta]]></category>

		<guid isPermaLink="false">http://www.nicolasneubauer.net/wordpress/?p=1</guid>
		<description><![CDATA[So I went back to putting some content on this page. I aim to provide some insight into my ongoing research, store some bits of technical information that might otherwise be lost, and will probably not be able to resist the occasional rant. We&#8217;ll see. Btw you will find I put some advertising on the [...]]]></description>
			<content:encoded><![CDATA[<p>So I went back to putting some content on this page.<span id="more-1"></span> I aim to provide some insight into my ongoing research, store some bits of technical information that might otherwise be lost, and will probably not be able to resist the occasional rant. We&#8217;ll see.  <!--more--></p>
<p>Btw you will find I put some advertising on the page. I wasn&#8217;t originally planning on this and also don&#8217;t expect to become rich by this in the near future, but the slick option editor of WordPress and this beautiful theme called <a title="Graphene" href="http://www.khairul-syahir.com/wordpress-dev/graphene-theme" target="_blank">http://www.khairul-syahir.com/wordpress-dev/graphene-theme</a> made it so simple I just had to try it out. Ah, well&#8230;. (same goes for all the social media bling ;)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nicolasneubauer.net/wordpress/2010/10/06/hello-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

