Please Note: This project is not currently active. The content on this site is provided for reference and is not actively maintained.

Archive for February, 2012

Training the Cloud with the Crowd: Training A Google Prediction API Model Using CrowdFlower’s Workforce

by February 29, 2012



Can a machine be taught to determine the sentiment of a Twitter message about weather?  With the data from over 1 million crowd sourced human judgements the goal was to use this data to train a predictive model and use this machine learning system to make judgements.  Below are the highlights from the research and development of a machine learning model in the cloud that predicts the sentiment of text regarding the weather.  The following are the major technologies used in this research:  Google Prediction APICrowdFlowerTwitter,  Google Maps.

The only person that can really determine the true sentiment of a tweet is the person who wrote it.  When the human crowd worker makes tweet sentiment judgements only 44% of the time do all 5 humans make the same judgement.  CrowdFlower’s crowd sourcing processes are great for managing the art and science of sentiment analysis.  You can scale up CrowdFlower’s number of crowd workers per record to increase accuracy, of course at a scaled up cost.

The results of this study show that when all 5 crowd workers agree on the sentiment of tweet the predictive model makes the same judgement 90% of the time.  When you take all tweets the CrowdFlower and Predictive model return the same judgement 71% of the time.  Both CrowdFlower and Google Predictions supplement rather than substitute each other.  As shown in this study, CrowdFlower can successfully be used to build a domain/niche specific data set to train a Google Prediciton model.  I see the power of integrating machine learning into  crowd sourcing systems like CrowdFlower.  CrowdFlower users could have the option of automatically training a predictive model as the crowd workers make their judgements.  CrowdFlower could continually monitor the models trending accuracy and then progressively include machine workers into the worker pool.  Once the model hit X accuracy you could have a majority of data stream routed to predictive judgments while continuing to feed a small percentage of data the crowd to refresh current topics and continually validate accuracy.  MTurk hits may only be pennies but Google Prediction ‘hits’ cost even less.


More »

How Do Greenhouse Gases Trap Heat in the Atmosphere?

by February 26, 2012

If you would like to see a high-quality infographic developed for this question, please share it with your network using the sharing icons above—these actions will increase the question’s priority. Please look at the example answers here. Also, if you would value a high-quality answer being available on the Web, please consider a small donation ($0.99) to support our research team.

More »

What Causes Oxygen to be Used Up In Deep Waters?

by February 26, 2012

ANSWER (CURRENTLY UNDER REVIEW): Bacteria use up the oxygen in the process of breaking down organic matter, like the waste from tiny animals and dead algae. This answer is focused on coastal ocean waters, but generally applies to deep waters in estuaries, ponds, lakes, and slow-moving stretches of rivers.

Q001 how oxygen is used up in deep waters v04

More »

From Toilet to Treatment to Treatment to Tap in San Diego

by February 10, 2012

dog-drinking-from-toilet_sm2 My first reaction to hearing at the breakfast table about today’s piece in the NY Times about water reuse in San Diego was that it isn’t all that different from what we have been doing for years: discharging treated waste water into streams and rivers and then drawing out drinking water downstream, counting on bacterial decomposition, dilution, and other processes to treat further the discharged water. I was pleased to see that this point was discussed in the article.

Drinking water that recently was flushed down a neighbor’s drain is a tough concept, pardon the pun, to swallow. However, as the piece in the Times correctly points out, we are headed into times in which resources like water are likely to be scarcer. To my mind, this community is a shining example of people—who rightly had very strong feelings on an issue—being willing to accept what the science community had to offer. Jerry Sanders, the mayor of this San Diego community, put it this way: “If science is behind you and you can prove that, I think people are willing to listen.” Here, here!

Image in post from Climate Watch. Thumbnail image on home page from the NY Times article.

More »

EarthQ Core Business Model: Where Is the Sweet Spot?

by February 1, 2012

Recently, we have been working to launch a project dubbed EarthQ. The goal has been to shepherd good questions and cultivate good answers by bringing the right mix of experts to Quora, as is described in this post about bringing the carbon cycle to Quora. This work continues, but so far, my attempts to draw in a few experts to help with answers have come up dry. The Quora team thinks it is okay for those who pose a question to answer it, so that will likely be my next step. Perhaps with a solid answer to the first question, What are the compartments in which carbon is stored on Earth?, I can then start to draw in some colleagues to endorse the answer by up-voting it… (more…)

More »