Using Google Insights to Track Linguistic Communities (6)
share
digg
by
Matthew Hurst (47)
on
Data Mining: Text Mining, Visualization and Social Media (45)
1 month, 3 weeks
ago
permalink
There are many fun things you can do with Google Insights. Putting in a term in a specific language, one can get a picture of where in the world people speaking that language live. Here are some examples (I used the fish for most translations). Russian: пицца (pizza) Japanese: 寿司 (sushi) Korean: 피자 (pizza) Dutch: bioskoop (cinema) etc.
-
George Tziralis said:
give google insights a try for "ελληνικά" to see how world-wide-spread the greek communities are :)
Stanford Superconducting Supersocializer seeks elementary particles of social interaction (1)
share
digg
by
Tim Finin (6)
on
UMBC ebiquity (6)
2 months, 3 weeks
ago
permalink
Last month we blogged about UMBC Sociologist Zeynep Tufekci who views social media from the perspective of social physics. Sociologist Kieran Healy puts the ideas together with a different result, as his post on discovering the elementary particles of social interaction) shows. (Spotted in a post by Mark Liberman on Language Log) “I’m looking forward to spending a bit of time at the Center for Advanced Study in the Behavioral Sciences this fall. Think of ...
If Google had Semantic Technology… (1)
share
digg
by
Dr. Riza C Berkan, CEO (0)
on
hakia Blog (0)
2 months, 3 weeks
ago
permalink
I am writing this blog article in response to the mention of hakia in the NYTimes blog article written by Saul Hansell, who is praising Google’s technology in wake of Google’s declining profits and 12% drop in its share price. Mr. Saul Hansell and I had an exciting conversation about the future of Web search. In comparison to Google, our point was, and always is, that semantic search is not an option, rather it is ...
Do you think because you write, or write because you think? (2)
share
digg
by
Daniel Lemire (16)
on
Daniel Lemire's blog (29)
3 months
ago
permalink
I used to believe that the pressure to publish what you did in research was inherently bad. About four years ago or so, I started to change my mind. I now believe that the more you write, the more you think about the issues, and the more ideas you have. In short, productive researchers do not write a lot because they are brilliant, they are brilliant because they write a lot. This statement has counterexamples, ...
Yoopick: A sports prediction contest on Facebook with a research twist (1)
share
digg
by
David Pennock (7)
on
Oddhead Blog (7)
3 months, 1 week
ago
permalink
I’m happy to announce the public beta launch of Yoopick, a sports prediction contest with a twist. You pick any range you think the score difference or point spread of the game will fall into, for example you might pick Pittsburgh wins by between 2 and 11 points. The more your prediction is viewed as unlikely by others, and the more you’re willing to stake on your prediction, the more you stand to gain. Of ...
Stacked Agents Model (1)
share
digg
3 months, 1 week
ago
permalink
This is research I did a while ago and presented Monday to fulfill the requirements of my Masters degree. The presentation only needed to be about 20 minutes, so it was a very short intro. We have moved on since then, so when I say future work, I really mean future work. The post is rather lengthy, so I have moved the main content below the jump. Recommender System Types Recommender systems come in two ...
-
Jason Adams said:
Sharing my own stuff!? Shameless self promotion.
On Friends, Followers, and the Top Twitter Users (2)
share
digg
by
Ben Lorica (13)
on
O'Reilly Radar - Insight, analysis, and research about emerging technologies (671)
3 months, 1 week
ago
permalink
An easy way to increase Twitter's signal-to-noise ratio is to follow less people. I'm sure you've heard of Twitter users who follow several thousand twitterers. How they keep up with that many micro-blogs is beyond me. Unfortunately, spammers have discovered that to increase their "following", they simply follow thousands of other users, a small percentage whom will politely start following them. Granted, number of followers is not as informative as the number of conversations a ...
The Tyranny of Five Stars (1)
share
digg
by
samidh (0)
on
Pluriblog (0)
3 months, 2 weeks
ago
permalink
In our last post, we talked about how “summary” is a much better way than “search” for people to make sense of opinions on the internet. There is, of course, one form of summary that already exists and is ubiquitous across the web: The Five Star system. There’s only one problem with it. The Five Star system sucks. The major problem with the Five Star system is that it can only tell you whether people ...
Code: Calculating Conditonal Entropy (1)
share
digg
by
dpn (0)
on
david petar novakovic: attempted axiomatisation (0)
3 months, 2 weeks
ago
permalink
I’m making the python code I’m using to do my evaluation of my tests results in my research available for anyone to use in their system. The code is designed to calculate the conditional entropy of a series of classifications stored in a confusion matrix. The code is an implementation of the ideas presented by Neill in his paper Fully Automatic Word Sense Induction using Semantic Clustering. I’ve included some short tests and documentation in ...
Google Makes us Stupid (1)
share
digg
by
Andre Vellino (0)
on
Synthèse (0)
3 months, 2 weeks
ago
permalink
So far, I’ve liked everything I’ve seen and read by Nicholas Carr (author of “The Big Switch: rewiring the world from Edison to Google”). I was interested and challenged by his recent article in The Atlantic “Is Google Making us Stupid“. His basic thesis is that the information overload that results from the availability of huge amounts of data from search engines is making us unable to read closely and think deeply. As part of ...
Why SMS GupShup is Bigger than Twitter (4)
share
digg
by
anand_rajaraman (23)
on
Datawocky (23)
3 months, 3 weeks
ago
permalink
Matt Marshall at VentureBeat liked my post on SMS GupShup, and asked me to write a follow-up guest post for VB. That post appears on VentureBeat today. Leaving aside questions of technology and scaling, I ask why SMS GupShup is bigger and growing faster than Twitter. My hypothesis: Microblogging is a nice-to-have in developed economies, like the US. It's a must-have in developing economies like India, China, and Egypt.In essence, microblogging is semi-synchronous publish-subscribe messaging. ...
Angel, VC, or Bootstrap? (3)
share
digg
by
anand_rajaraman (23)
on
Datawocky (23)
4 months
ago
permalink
Note: I wrote this piece a couple of weeks back, inspired by Greg Linden's blog post (see below). Inc then picked up the piece and asked me not to publish it until it appeared on the Inc website. The article appears on the Inc website today with some minor edits.Greg Linden was one of the key developers behind Amazon's famous recommendations system -- the system that recommends books, movies, and other products to Amazon customers ...
Here Be Dragons (1)
share
digg
by
The Bad Astronomer (50)
on
Bad Astronomy (304)
4 months
ago
permalink
My friend, fellow skeptic, and fellow Skeptologist Brian Dunning has put together a video on how to think critically. It’s called Here Be Dragons, and it’s a pretty good primer into how to think. It’s about 40 minutes long, and free to use (with some caveats; see the site). I think this would do well in a classroom. Any teachers out there? I know it’s too late for most school sessions, but you can download ...
From Graph Drawing to Tag-Cloud drawing? (1)
share
digg
by
Daniel Lemire (16)
on
Daniel Lemire's blog (29)
4 months
ago
permalink
Tag clouds are an interesting visualization technique because, unlike bar charts, you can easily display 30 or 50 weights in a compact figure. Moreover, because they are a 2D structure, you can more easily cluster similar tags together. The Tag-Cloud Drawing problem is the optimization of the layout of the tag clouds. It is somewhat related to the Graph Drawing problem. Recently, Fujimura et al. showed how to scale tag clouds further… up to 5,000 ...
rdf:about is a concise collection of RDF resources (1)
share
digg
4 months
ago
permalink
Joshua Tauberer, a Upenn Linguistics graduate student, maintains rdf:about as a resouce of information on the semantic web language RDF. Its a consise collection of information that manages not to overwhelm and includes good Quick Intro and RDF in Depth pages. (spotted on SWIG Scratchpad)
-
Jason Adams said:
Very useful if you're working with OpenCalais at all.
How George W. Bush is changing my life for the better (1)
share
digg
by
scott (393)
on
Shtetl-Optimized (16)
4 months, 2 weeks
ago
permalink
I’ll give you a hint: it’s not from the rebate checks. Let me put it this way: from now on, I am going to exercise at least twice a week. I’m going to post the remaining Quantum Computing Since Democritus lectures. I’m going to finish writing up several papers that I’ve been procrastinating on for years. I’m going to get involved in more non-work-or-blog-related social activities. And I’m going to do all of these things ...
Three levels of addressing the Netflix Prize (1)
share
digg
by
YehudaKoren (0)
on
Machine Learning (Theory) (10)
4 months, 3 weeks
ago
permalink
In October 2006, the online movie renter, Netflix, announced the Netflix Prize contest. They published a comprehensive dataset including more than 100 million movie ratings, which were performed by about 480,000 real customers on 17,770 movies. Competitors in the challenge are required to estimate a few million ratings. To win the “grand prize,” they need to deliver a 10% improvement in the prediction error compared with the results of Cinematch, Netflix’s proprietary recommender system. Best ...
Clinton resorts to anti-intellectualism (1)
share
digg
by
Ian (81)
on
Ian's blog (0)
5 months, 1 week
ago
permalink
Clinton’s proposal to alleviate the hardship imposed by high gas prices by not taxing gas is a joke, its basically a gift to the oil companies from taxpayers (and lets face it, they’ve been having a really hard time lately, haven’t they?). The price of gas is limited only by how much people are willing to pay for it. If less of what they pay goes to the government, the beneficiary will be the oil ...