Like Bitly, Twitter has a great real-time data set and very smart data scientists and engineers. But instead of relying on a primarily computational solution, Twitter treats real-time search more like a CAPTCHA problem. With this kind of messy data, lots of human brains can find meaning much faster and more accurately than lots of lines of code. So Twitter uses a real-time computation system called Storm to identify search spikes, then Mechanical Turk (Amazon’s crowdsourcing online platform for small jobs) to farm out annotating that data to human beings all over the world. The annotations basically take the spiking search term and tag it for relevance and intent. A human annotator (Twitter calls them “judges”) can tell Twitter’s systems whether searches for “Stanford” refer to a university or to its football team, or that searches for “Big Bird” aren’t primarily referencing a children’s show, but a political debate. This helps Twitter make trending topics smarter and more coherent.
But here’s the dark stroke of genius behind using huge masses of people to help sort out the meaning of Twitter searches: part of the judges’ task is also to match spiking search terms with pictures, events, and other categories that can help Twitter serve up relevant advertising. “For example, suppose our evaluators tell us that [Big Bird] is related to politics; the next time someone performs this search, we know to surface ads by @barackobama or @mittromney, not ads about Dora the Explorer.” The judges are like little focus groups that match intent with revenue.
Twitter just told us how cool its real-time search is… and how it makes its money | The Verge (via Tim M.)
It’s cheap humans all the way down.
57 notes
-
ipushmongoswitchtoo likes this
-
knav3rs3ong reblogged this from darkthoughtsbrightdays
-
geoffnorthcott likes this
-
worsethandetroit likes this
-
stressfm-feed reblogged this from new-aesthetic
-
iscarlets reblogged this from new-aesthetic and added:
Le cose stanno così: Twitter sa quali sono gli argomenti di cui si parla di più in ogni dato momento, ma non sa...
-
youremyblanketofstarsalways reblogged this from iamdanw
-
phillchill likes this
-
iamdanw reblogged this from new-aesthetic
-
selckiku likes this
-
postmaterialculture likes this
-
rjnskl likes this
-
sniffandflehmen likes this
-
franciscohui likes this
-
thethiefandthecobbler likes this
-
ekstasis likes this
-
matthew likes this
-
tonyhschu reblogged this from new-aesthetic
-
tonyhschu likes this
-
loganlape reblogged this from new-aesthetic
-
haraldpeter likes this
-
purestform likes this
-
xvcvx likes this
-
oppen likes this
-
kristen-is-a-ho likes this
-
julianstahnke reblogged this from new-aesthetic
-
buzzeins likes this
-
maltstream reblogged this from hibernationstation
-
plaintea likes this
-
raphael7848 likes this
-
knoxnicole likes this
-
bruvu likes this
-
notebookofmaxfenton reblogged this from new-aesthetic
-
maxfenton likes this
-
petermobeter likes this
-
mydaddyisarobot reblogged this from new-aesthetic
-
hibernationstation reblogged this from new-aesthetic
-
darkthoughtsbrightdays reblogged this from new-aesthetic
-
sarok likes this
-
sleepside likes this
-
dackdel reblogged this from new-aesthetic
-
dackdel likes this
-
christopherpatricktaylor reblogged this from new-aesthetic
-
artlung reblogged this from new-aesthetic
-
artlung likes this
-
tempfolder likes this
-
formuladumb reblogged this from new-aesthetic
-
forwardretreat likes this
-
somethinginparticular reblogged this from new-aesthetic
- Show more notes