About Media Industry

BNET Media provides daily industry trends and news coverage with insights for managers and executives in publishing, print, broadcast, film, and online media. In addition to media company profiles, we bring you industry analysis on new partnerships, media products, mergers and acquisitions, labor and cost management, media buying, investments and a host of other important business issues.

MediaCloud: Charting News Trends Online

By David Weir | Mar 12, 2009

During this unfolding drama of the death of the media industry as we know it, we face the constant choice of whether to essentially look backwards, and chronicle the collapse of one sector after another; or look forward and document the birth of a new media system right before our eyes.

Now that my colleagues in traditional media are taking up the first challenge, as witnessed by today’s excellent roundup in The New York Times by Richard Pérez-Peña headlined “As Cities Go From Two Papers to One, Talk of Zero,” I am going to try and focus on the less well-charted territory of our common media future.

For help with that, today let’s turn to Harvard’s Berkman Center for the Study of Internet & Society, which has just launched an intriguing new experiment called MediaCloud. In an interview with Joshua Benton of the Nieman Journalism Lab, Berkman Fellow Ethan Zuckman describes how MediaCloud works:

“It is a very large set of data, as well as some simple tools for playing with it, obtained by subscribing to and processing hundreds of American blogs and a couple hundred newspapers in English from around the world.

“The idea behind this is that we subscribe to the RSS feeds of these newspapers and these blogs. We grab every single story that they publish. We then pull the story text out of the HTML, which is an interesting hack. We throw the story text into a bunch of different tools that help us determine what the stories are about. So we’re able to get topic information. We’re able to get information on people mentioned in the stories — what’s called named entities.

“And then we file this all off in a database. So if you then want to find out what the stories were on Fox News for a given week, we can tell you what their top-10 topics were. We can also go levels further and say: When a news source reported on a topic, what other topics were most closely associated with it?”

Playing around with MediaCloud this morning, I can see that it is very much a work in progress, but that its upside is tremendous. Comparing the top ten terms associated with the key word Google in The New York Times, the San Francisco Chronicle, and Pajamas Media (a blogging site where I occasionally post media items), I obtained the following results:

  • The Times most frequently mentioned Microsoft, eBay, Yahoo, and a host of computer and electronics terms (as well as porn) in its coverage of Google, suggesting a definite business focus.
  • The Chronicle most frequently mentioned places (as well as one person)– U.S., San Francisco, Washington, California, the White House, Barack Obama – suggesting more of a political and geographical focus in its Google coverage, although Yahoo, Microsoft and YouTube did make the list.
  • Pajamas had an almost entirely different list dominated by the topics its bloggers obses about — Iraq, Hamas, Israel, Gaza, food, Republican Party, Obama — indicating Google’s main function here was as a search tool.

Mine is quite a primitive analysis, but you can see some of the potential comparative value for professors, students, and analysts following the media industry, old and new alike. What’s perhaps less obvious is the opportunity for new media businesses.

As a new media company emerges, part of its winning strategy will have to be to define those niches in media coverage where it might expand its audience most rapidly. MediaCloud may emerge as the kind of data analysis tool that helps entrepreneurs do just that. There are others — Google’s News Trends for one — but this is still underdeveloped territory, worth monitoring closely as it develops.

As my Bnet colleague Erik Sherman notes in his post on MediaCloud today: “…the real value the cloud is going to offer is the ability to combine information from different sources and gain new levels of understanding in such areas as strategic planning, operations, or market analysis.”

Thanks to Tamara Baltar and Erik Sherman for pointing me to today’s topics.

In addition to serving as a BNET Media analyst/blogger, David Weir is a veteran journalist and the author of several books. Weir is a co-founder and vice-president of the Center for Investigative Reporting, as well as an editorial board member of The Nation.

BNET User Analysis

Web Buzz:
  • Search Insider: Lost Buzzwords Of SES!

    MediaPost - 246 days 4 hours 9 minutes ago

    Search Engine Strategies NY is upon us this week, and it's time to celebrate with a look backward. Backward? Yes, because the search marketing industry, like the tech sector of which it is a part, is far too forward-looking. While it's great that our eyes are so much on the future, there's a lot you can find out studying what's gone before. Past...

  • The Late Tom Beardmore. The Braveheart

    NHS Exposed - 234 days 10 hours 9 minutes ago

    During the year 2000, Chris Askew, Will Powell, Jay Illagaratne and Thomas Beardmore and I led a Human Rights in the NHS protest . Ironically, it was about 9 years next month that the protest was held in front of the Department of Health. We warned of the high death rate and the mistreatment of disabled people by the NHS. This was long before...

  • When Will (Post-Iran) Twitter Grow a Businss Model?

    BNET Media - 160 days 1 hour 24 minutes ago

    One perplexing difficulty we face here at Bnet as we document Twitter’s prominent role the events unfolding in Iran is the young company’s utter lack of any apparent business model. As my colleague Erik Sherman outlines in a new post, just catching a wave — even as big a one as this current spike in a news cycle — will not...

  • 5 Unique Stories of Social Media Saving the Day

    Mashable - 172 days 19 hours 52 minutes ago

    David Spark ( @dspark ) is the founder of Spark Media Solutions , an organization that helps companies build industry voice through storytelling and social media. He blogs at The Spark Minute and can be seen and heard regularly on Cranky Geeks, KQED, Green 960, and ABC Radio. From tracking fires through Twitter to breaking news before...

  • Notes from BIO: A "Fire Drill" on Flu

    The In Vivo Blog - 186 days 9 hours 25 minutes ago

    When it comes to the "swine flu" outbreak (forgive the non-PC term), its hard not to look on the dark side. Either we are facing a catastrophic outbreak we are unprepared to prevent, or we have succumbed to yet another media-stoked panic that makes us all feel silly.Clearly the zeitgeist has tilted heavily towards the latter view, and surely we...

Links from the Web Buzz:
 

BNET TalkbackShare your ideas and expertise on this topic

Please add your comment:

  1. You are currently: a Guest |
  2.  

Basic HTML tags that work in comments are: bold (<b></b>), italic (<i></i>), underline (<u></u>), and hyperlink (<a href></a)

advertisement
advertisement
  • Click Here
  • Click Here
  • Click Here
advertisement