Thursday, October 10, 2013
Opening up Course Builder data
Course Builder is an experimental, open source platform for delivering massive online open courses. When you run Course Builder, you own everything from the production instance to the student data that builds up while your course is running.
Part of being open is making it easy for you to access and work with your data. Earlier this year we shipped a tool called ETL (short for extract-transform-load) that you can use to pull your data out of Course Builder, run arbitrary computations on it, and load it back. We wrote a post that goes into detail on how you can use ETL to get copies of your data in an open, easy-to-read format, as well as write custom jobs for processing that data offline.
Now we’ve taken the next step and added richer data processing tools to ETL. With them, you can build data processing pipelines that analyze large datasets with MapReduce. Inside Google we’ve used these tools to learn from the courses we’ve run. We provide example pipelines ranging from the simple to the complex, along with formatters to convert your data into open formats (CSV, JSON, plain text, and XML) that play nice with third-party data analysis tools.
We hope that adding robust data processing features to Course Builder will not only provide direct utility to organizations that need to process data to meet their internal business goals, but also make it easier for educators and researchers to gauge the efficacy of the massive online open courses run on the Course Builder platform.
Wednesday, October 9, 2013
50 million lines of code and counting: supporting students in open source
To date, the program has produced 50 million lines of open source code from more than 8,500 student developers—and in 2014, we'll mark the 10th anniversary of Google Summer of Code.
To celebrate the previous nine years of student contributions and set the stage for the best Google Summer of Code yet, we’re launching 10 things to make the program better than ever. Here’s a peek at what we’ll be up to, stay tuned to the Open Source blog for updates:
- We’re planning 10 visits to countries with the highest participation throughout the year to promote the program and celebrate local students and mentors.
- A 10 percent increase in the student stipend, bringing the amount to $5,500.
- We’re also accepting 10 percent more students than ever before—more than 1,300 students will spend their summer coding as part of the program next year
- A 10-year reunion mentor summit held at Google’s Mountain View campus for our Google Summer of Code organization alumni.
We’re excited to be running a program that touches a lot of lives around the world every year, and we want to celebrate all of the accomplishments we’ve seen from our participants.
We’re also committed to getting younger students involved in open source software. For the fourth consecutive year, in November we’ll run Google Code-in, an international contest designed to introduce 13-17 year old students to the world of open source development. You don’t have to be a coder to get involved in this contest; there are a variety of ways students can contribute to open source projects.
Each year, open source software is becoming more important to governments and industries such as healthcare, gaming and technology. We believe that investing in youth and open source will improve both technology and society. Applications for Google Summer of Code open in March and students interested in Google Code-in can apply starting November 18. See our Open Source blog post for more details on both programs. Here’s to the next year in open source!
Posted by Carol Smith, Open Source Programs Office
Tuesday, April 23, 2013
A new kind of summer job: open source coding with Google Summer of Code
Over the past eight years more than 6,000 students have “graduated” from this global program, working with almost 400 different open source projects. Students who are accepted into the program will put the skills they have learned in university to good use by working on an actual software project over the summer. Students are paired with mentors to help address technical questions and concerns throughout the course of the project. With the knowledge and hands-on experience students gain during the summer they strengthen their future employment opportunities in fields related to their academic pursuits. Best of all, more source code is created and released for the use and benefit of all.
Interested students can submit proposals on the website starting now through Friday, May 3 at 12:00pm PDT. Get started by reviewing the ideas pages of the 177 open source projects in this year’s program, and decide which projects you’re interested in. Because Google Summer of Code has a limited number of spots for students, writing a great project proposal is essential to being selected to the program—be sure to check out the Student Manual for advice.
For ongoing information throughout the application period and beyond, see the Google Open Source blog, join our Summer of Code mailing lists or join us on Internet relay chat at #gsoc on Freenode.
Good luck to all the open source coders out there, and remember to submit your proposals early—you only have until May 3 to apply!
Posted by Stephanie Taylor, Open Source team
Tuesday, February 26, 2013
Applauding the White House Memorandum on Open Access
Last week the Obama Administration issued a Memorandum that could vastly increase the impact of federally funded research on innovation and the economy. Entrepreneurs, businesses, students, patients, researchers, and the public will soon have digital access to the wealth of research publications and data funded by Federal agencies. We're excited that this important work will be made more broadly accessible.
This memorandum directs federal agencies with annual research and development budgets of $100 million or more to open up access to the crucial results of publicly funded research (including both unclassified articles and data). These agencies will need to provide the public with free and unlimited online access to the results of that research after a guideline 12 month embargo period. Before last week only one agency, the National Institutes of Health, had a public research access policy.
The federal government funds tens of billions of dollars in research each year through agencies like the National Science Foundation, National Institutes of Health, and the Department of Energy. These investments are intended to advance science, accelerate innovation, grow our economy, and improve the lives of all Americans and members of the public. Opening this research up to the public will accelerate these goals.
Federal investment in research and development only pays off if it has an impact. Researchers, businesses, policymakers, entrepreneurs, and the public need to be able to access and use the knowledge contained in the articles and data generated by those funds. Making the results of scholarly research accessible and reusable in digital form is one important way to increase the impact of existing taxpayer investments.
Wednesday, November 21, 2012
Discover the world of open source with Google Code-in 2012
Starting Monday, November 26 and for the following 50 days, contest participants will work on fun online tasks for 10 different open source organizations. Possible challenges include documentation, marketing outreach, software coding, user experience research and more.
Participants earn points for each task they successfully complete and can earn prizes like T-shirts and certificates of completion. This year we’re doubling the number of grand prize winners to 20 talented students, who will win a trip to Google’s headquarters in Mountain View, Calif. They’ll spend a day getting a tour of the “Googleplex,” meeting and talking with Google engineers, plus enjoy another full day exploring San Francisco and other surprises.
Last year, 542 students from 56 countries and 360 schools completed 3,054 tasks during the eight-week contest. This year we want to encourage even more students to participate in the contest and learn about open source development. If you’d like to sign up, please review our Frequently Asked Questions and the contest rules on our program site. You can also watch our screencast, check out some sample tasks from last year’s contest and join our discussion list for any other questions. For details on important dates for the contest, see the timeline. You can register for your account on the program site when the contest opens on Monday, November 26 at 9:00am PST.
Finally, our Open Source Programs team will be hosting a Hangout on Air on the Google in Education page November 26 at 2:00pm PST to discuss the details of the Google Code-in contest and answer any questions.
We hope you’ll spend your winter (or summer, for our friends in the southern hemisphere) learning about the ins and outs of open source development through hands-on experience. Ready...set...
Posted by Stephanie Taylor, Open Source Programs
Friday, October 5, 2012
ReFr: A New Open-Source Framework for Building Reranking Models
We are pleased to announce the release of an open source, general-purpose framework designed for reranking problems, ReFr (Reranker Framework), now available at: http://code.google.com/p/refr/.
Many types of systems capable of processing speech and human language text produce multiple hypothesized outputs for a given input, each with a score. In the case of machine translation systems, these hypotheses correspond to possible translations from some sentence in a source language to a target language. In the case of speech recognition, the hypotheses are possible word sequences of what was said derived from the input audio. The goal of such systems is usually to produce a single output for a given input, and so they almost always just pick the highest-scoring hypothesis.
A reranker is a system that uses a trained model to rerank these scored hypotheses, possibly inducing a different ranked order. The goal is that by employing a second model after the fact, one can make use of additional information not available to the original model, and produce better overall results. This approach has been shown to be useful for a wide variety of speech and natural language processing problems, and was the subject of one of the groups at the 2011 summer workshop at Johns Hopkins’ Center for Language and Speech Processing. At that workshop, led by Professor Brian Roark of Oregon Health & Science University, we began building a general-purpose framework for training and using reranking models. The result of all this work is ReFr.
From the outset, we designed ReFr with both speed and flexibility in mind. The core implementation is entirely in C++, with a flexible architecture allowing rich experimentation with both features and learning methods. The framework also employs a powerful runtime configuration mechanism to make experimentation even easier. Finally, ReFr leverages the parallel processing power of Hadoop to train and use large-scale reranking models in a distributed computing environment.
Tuesday, March 27, 2012
A new kind of summer job: open source coding with Google Summer of Code
Google Summer of Code is a global program that gives university students a stipend to write code for open source projects over a three month period. Accepted students are paired with a mentor from the participating projects, gaining exposure to real-world software development and the opportunity for future employment in areas related to their academic pursuits. Best of all, more source code is created and released for the use and benefit of all.
For the past ten days, interested students have had the opportunity to review the ideas pages for this year’s 180 accepted projects and research which projects they would like to contribute to this year. We hope all interested students will apply! Submit your proposal to the mentoring organizations via the Google Summer of Code program website from today through Friday, April 6 at 19:00 UTC.
Google Summer of Code is a highly competitive program with a limited number of spots. Students should consult the Google Summer of Code student manual for suggestions on how to write a quality proposal that will grab the attention of the mentoring organizations. Multiple proposals are allowed but we highly recommend focusing on quality over quantity. The mentoring organizations have many proposals to review, so it is important to follow each organization’s specific guidelines or templates and we advise you to submit your proposal early so you can receive timely feedback.
For more tips, see a list of some helpful dos and don’ts for successful student participation written by a group of experienced Google Summer of Code administrators, our user’s guide for the program site, Frequently Asked Questions and timeline. You can also stay up-to-date on all things Google Summer of Code on our Google Open Source blog, mailing lists or on Internet relay chat at #gsoc on Freenode.
To learn more about Google Summer of Code, tune in to the Google Students page on Google+ next Monday, April 2 at 3:30pm PT for a Hangout on Air with open source programs manager Chris DiBona. He'll be talking about Google Summer of Code with other members of the open source team at Google. Submit your questions about the program between now and next Monday using the hashtag #gsochangout, and Chris and the open source team will answer them live during the Hangout On Air.
Good luck to all the open source coders out there, and remember to submit your proposals early—you only have until April 6!
Posted by Carol Smith, Open Source Team
Tuesday, March 6, 2012
Keeping an “OER mind” about shared resources for education
With ever-increasing demands being placed on our education system, including new skill sets that need to be taught to create a pipeline that can fill 21st century jobs, we must figure out how to make high-quality education more accessible to more people without overburdening our existing educational institutions. The Internet, and the platforms, tools and programs it enables, will surely be a part of the answer to this challenge.
Open Educational Resources (OER) are one piece of the solution. OER are teaching and learning resources that anyone can share, reuse and remix. As part of our ongoing commitment to increasing access to a cost-effective, high-quality education, we’re supporting the OpenCourseWare Consortium — a collaboration of higher education institutions and associated organizations from around the world creating OER — in organizing Open Education Week 2012, which begins today.
An example of OER in action is OpenStax, a recent non-profit initiative of Rice University and Connexions to offer students free, professional quality textbooks that meet scope and sequence requirements for several courses. They believe that these books could save students over $90 million in the next five years. Non-profit isn’t the only model for open education. Flat World Knowledge has built a business around OER by providing free online access to open textbooks, then selling print-on-demand copies and supplemental materials.
We’ll be acknowledging OER week through a panel event in Washington, DC, and over on our +Google in Education page, where we’ll be posting articles, sharing stories and interviews about the benefits of open education resources. Opening these resources to everyone can improve the quality of education while getting more out of our investments in educational resources. We hope you’ll join us in celebrating Open Education Week. Go to openeducationweek.org to learn more and get involved.
Monday, January 30, 2012
Data and code open sourced from Google's Renewable Energy Cheaper than Coal project
Cross-posted with the Open Source at Google Blog
Google’s RE<C renewable energy research project has recently open sourced a new tool and a significant amount of data to support future CSP (concentrating solar power) heliostat development.
HOpS Open Source Site
HOpS, heliostat optical simulation, is an open source software tool for accurately and efficiently performing optical simulations of fields of heliostats, the actuated mirror assemblies that direct sunlight onto a target in CSP applications.
Google used this tool to help evaluate heliostat field layouts and calculate heat input into a CSP receiver for power production. HOpS works by passing "packets" of light between optical elements (the sun, heliostats, and elements of the target surface), tracking shadowing and blocking masks along the way. For our analysis goals, this approach gave our researchers more flexibility and accuracy than analytic tools (such as DELSOL or HFLCAL), and it was easier to set up for thousands of runs than using ray tracers. Output from the simulation includes heliostat efficiency, target irradiance, and more, while an included shell script facilitates plotting heat maps of the output data using gnuplot.
REC-CSP Open Source Site
The REC_CSP open source project contains data sets and software useful for designing cheaper heliostats.
Available on the project site are:
1. Thirty days of three-dimensional wind measurement data taken with ultrasonic anemometers (sampled at ~7 Hz), recorded at several near surface elevations. The data is presented in the RE<C wind data collection document and is available for download on the open source site here.
2. A collection of heliostat aerodynamic load data obtained in a NASA wind tunnel and graphically represented in the appendix. This data is available for download on the open source site here.
3. Matlab software for high-precision, on-target heliostat control with built-in simulation for testing. This is essentially the same software used in the RE<C heliostat control demonstrations and described in the accelerometer sensing and control system design documents. The source code is available for download here.
Friday, November 18, 2011
Explore open source with the Google Code-in contest
Participants earn points for each task they successfully complete and can earn prizes like t-shirts, cash and certificates of completion. The ten participants with the highest points earned by the end of the competition receive a grand prize trip to Google headquarters in Mountain View, Calif. next spring for themselves and a parent or legal guardian. They’ll spend the day getting a tour of campus, meeting Google engineers and enjoying other fun surprises.
Last year’s Google Code-in had 361 students from 48 countries completing 2,167 tasks over the course of the the eight-week contest. We hope to have even more students participate this year. Help us spread the word by telling your friends, classmates, children, colleagues, teachers—everyone!
If you’d like to sign up, please review our Frequently Asked Questions and the contest rules on our program site. You can also join our discussion list for any other questions. For details on important dates for the contest, see the timeline. You can go ahead and register for your account now on the program site so you will be able to start claiming tasks right away when the contest opens on Monday, November 21 at 12:00am (midnight) PST.
We hope you’ll spend your winter (or summer, for our friends in the southern hemisphere) learning about the ins and outs of open-source development through hands-on experience. On your marks...
Posted by Stephanie Taylor, Open Source Programs