Data Journalism in Norway

I spent some time in Bergen last week (lovely place, bloody expensive beer!) to talk to some people about a new content management system.  Whilst I was there I dropped in on a seminar about data

…dedicated to the emerging “web of data” and how it could create new possibilities in a deeply disrupted media economy.

The shorthand for this was ‘breaking out of the silos’. To underline that point, some of the organizers  were running around in municipal-workers jackets. That was a bit lost on me other than thinking Norwegian workers are pretty snappy dressers!

It turned out to be a really interesting, mixed bag of people who were fired up by the possibilities of linking open data (LOD).

Pia J.V. Josendal opened the batting with a neat presentation that was kind of a dummies guide to data. A few interesting things in there for me like finding out what a triple is and also the five star rating system for your data.

The next delegate was Hjalmar Gislason from DataMarket.com, a nifty website that collects data (time series at the moment) and lets you visualize it like this.

I was struck by what a cool name they had and pondered that it shows just how recent the mainstream interest in this stuff is that you could get a name like that. Hjálmar Gíslason agreed.

@ Yup, when we secured the domain in 2009, the term "data market" had hrdly been coined. Imgine?!
@datamarket
DataMarket

His presentation was quite nifty too.

One presentation I couldn’t stay for but looked really interesting was Rune Smistad’s run through rNews (a proposed standard for using RDFa to annotate news-specific metadata in HTML documents) The slides are interesting but I think I missed out on not hearing the context.

There was a heavy presence of journalists but they were by no means the majority, it wasn’t a data journalism conference. But it was clear that everyone thought that journalism was the place that the concept was getting most traction and most use.

The UK got a lot of love for it’s data-J work during the sessions but I saw a lot of similarities in the approaches. It also showed me that there are a lot of tech people, people who understand all this triples, sparq and data stuff. They can see the use for it and they have a passion for getting it out there. It doesn’t matter that  they are in Norway (or the UK for that matter) they just want journalists to come and do good stuff with the data they are freeing from the silos.

Guardian local:Failed experiment?

Lets get this hyperlocal stuff back to London. Just don't get any on your hands (via Wikipedia)

In short. No and credit for trying.

I was frustrated by much of the twitter chat around the announcement that Guardian local was no more. The insinuation seemed to be that this was a failed experiment. I tweeted that no experiment is a failure.

So it’s good to hear that the Guardian:

have also learned from the local communities who got involved with telling their stories. And using this we have continually refined our approach over the past year.

But it’s scary to hear

One of the guiding principles of the local blogs has been dialogue with communities about situations and topics of mutual interest and concern. There will still be plenty of that on guardian.co.uk – for example, in our growing army of local cutswatchers, monitoring local council activities – but we felt, in that spirit, that we should share the thinking behind the local experiment with you, the readers who have been involved all along

‘We value the community so much that we want to bring it all in to one place’. It’s trying to take community out of the community.’ Thanks for allowing us to experiment on you but we’re back off to London now. We’ll call you when we need you!’

But I’m trying not to be parochial about this and dismiss it as the Guardian cementing the London centric nature of their broader community offering. (Who wants to be a member of the ‘Guardian club’ (the touchy feely response to the paywall argument) if there is no room for local communities? But, hey, I did see a masterclass in Manchester.)

No,  I’m sure that the Guardian has learned loads and will see the benefit. I’m sure they understand how to run a crowd now. I’m sure they see the value in having someone on the ground. They must see the potential of new technology in having faster, targeted and responsive journalism. It even strengthened their brand – albeit in a passive way.

So a lot for the Guardian to be proud of. But any the failure of any experiment comes from how you use the results not the experiment itself. And they’ll fail if they take the results and don’t keep the hyperlocal team.

Relationships matter

Talking to community managers from the small hyperlocals to the big players like Propublica it’s clear that there is real value in the experience of handling a crowd at grass roots. I’m sure that Hannah, John and Michael have that in spades. I never met John and Michael but did meet Hannah. She is whip smart. I’ve no reason to assume that John and Michael are any different because Sarah, who drove the project, is planet-size smart when it comes to this stuff. To lose them would be a bad fail.

The truth is that the value of the Guardian local communities rests with them; their work and their relationship building. The unique nature of each area can’t be homogenised in to a broad model. The people who are upset to see the sites go didn’t have a relationship with the Guardian – the Guardian is the bastard that broke their relationship up!  You can’t just transplant the Guardian Cardiff model anywhere. You could put Hannah or John or Michael anywhere and they’d use that experience. But you might also lose some of their passion and, with the best will in the world, there would be little or no reason for their Guardian Local audiences to follow them.

That’s why I stand by my belief that hyperlocal is not a model that large media organisations can ever get right.

I wanted the Guardian to prove me wrong and for a while they did. They let the hyperlocals have an identity and quietly absorbed the experience.Yes!  Then they went and blew it by acting like the Guardian rather than letting the sites speak for themselves and standing by their belief “that journalism plays a vital role in communities”

Update: A great storify from Sarah on the closure – the tweets alone show what people feel about the move.

More updates: Rick Waghorn, whose Addiply system was used for ads on the local sites, is very nice in saying I was one of the people who ’get’ the biggest lesson to learn from the saga. Shucks!  Tom Allen has a good post about the response to the closure including the start of fundraising efforts to try to take the guardian up on their offer of partnership. I was surprised that the Guardian only considered this ‘after’ the closure announcement rather than seeing it as part of the exit strategy. But the support is showing and efforts to find ways to fund the sits are underway. Matt Edgar suggests using Guardian subscriptions to pay - don’t subscribe to the Guardian any more, subscribe to hyperlocal. I’m sure the loss of subscriptions was not in the Guardians mind when they closed the site.

 The Media briefing also picks up on the this thread as Ed Oldfied looks at how the story developed and what happens next. There are some interesting facts and figures in the post but it does rest of the word failed again! A point that is picked up in an addition to the post:

It is worth pointing out that while the business model of these sites was unproven and ultimately unviable, the publishing model from a content perspective was a success – as proven by the outpouring of anger from readers in Leeds, Edinburgh and Cardiff, and the awards and accolades the three beatbloggers gathered.

Sarah Hartley, who led the project, doesn’t agree with Ed’s analysis and “takes exception to the term ‘failed’”, preferring to describe the project as “halted, stopped, concluded”.

I agree.

MCEngine: New WordPress plugin

A few years ago I wrote a little plugin called Feedback by Paragraph (feedbackBP). It added a little link at the end of each paragraph in a post which, when clicked, popped up a comment box so you could comment on parts of a post not just the end. Why?

This plugin was written to help me feedback comments on blog posts to my journalism students who use wordpress as a base for their online publications.

It was based on an idea I saw at http://newsmixer.us which allows users to comment or ask questions on a particular paragraph. The creators of newsmixer are looking to turn the thing in to an API with a wordpress plugin which would be cool but seems a little way off.

The closest plugin I could find is one called marginalia (http://marginalia.cc/) which looks very nice but doesn’t seem to play well with WordPress 2.8. So I’ve written Feedback by Paragraph to fill the gap.

One of the great things about developing a plugin is that you get the occasional reference in your logs as people install them. I’ve seen a number of interesting sites as a result. But the best feedback comes from comments and from trackbacks for the occasional mention the plugin gets.

I had a bit of free time and thought it was time to dust of feedbackBP and see if I could tweak it. One thing lead to another and the result…MCEngine.

It’s a bit of leap forward (in functionality if not in my coding skills) but the beta is now available if you want to give it a go.

Let me know what you think.

Flyposting newspaper websites

Aaaaaaaarrrrrrggggggh!

Imagine the scene. I’m on the bus. I’ve found a seat that isn’t near the bloke who shouts at cars and smells vaguely of rabbits. My headphones are in (but not too loud,of course).

I take out my copy of the Birmingham Post and open it up. Scanning around the page I see an article that catches my eye. But just before I start reading…the person sat behind me pulls out a pot of wallpaper paste and slathers a great billboard poster across the top of the page.

It turns out that in scanning around I inadvertently caught the eye of an advert nestling in the corner of the page.

Sound plausable? No I didn’t think so.

So please stop doing it on your bloody websites newspaper people.

That is all.

ScraperWiki: Hacks and Hackers day, Manchester.

If you’re not familiar with scraperwiki it’s ”all the tools you need for Screen Scraping, Data Mining & visualisation”.

These guys are working really hard at convincing Journos that data is their friend by staging a steady stream of events bringing together journos and programmers together to see what happens.

So I landed at NWVM’s offices to what seems like a mountain of laptops, fried food, coke and biscuits to be one of the judges of their latest hacks and hackers day in Manchester (#hhhmcr). I was expecting some interesting stuff. I wasn’t dissapointed.

The winners

We had to pick three prizes from the six of so projects started that day and here’s what we (Tom Dobson, Julian Tait and me)  ended up with.

The three winners, in reverse order:

Quarternote: A website that would ‘scrape’ myspace for band information. The idea was that you could put a location and style of music in to the system and it would compile a line-up of bands.

A great idea (although more hacker than hack) and if I was a dragon I would consider investing. These guys also won the Scraperwiki ‘cup’ award for actually being brave enough to have a go at scraping data from Myspace. Apparently myspace content has less structure than custard! The collective gasps from the geeks in the room when they said that was what they wanted to do underlined that.

Second was Preston’s summer of spend.  Local councils are supposed to make details of any invoice over 500 pounds available, and many have. But many don’t make the data very useable.  Preston City council is no exception. PDF’s!

With a little help from Scraperwiki the data was scraped, tidied and put in a spreadsheet and then organised. It through up some fun stuff – 1000 pounds to The Bikini Beach Band! And some really interesting areas for exploration – like a single payment of over 80,000 to one person (why?) – and I’m sure we’ll see more from this as the data gets a good running through.  A really good example of how a journo and a hacker can work together.

The winner was one of number of projects that took the tweets from the GMP 24hr tweet experiment; what one group titled ‘Genetically modified police’ tweeting :). Enrico Zini and Yuwei Lin built a searchable GMP24 tweet database (and a great write up of the process) of the tweets which allowed searching by location, keyword, all kinds of things. It was a great use of the data and the working prototype was impressive given the time they had.

Credit should go to Michael Brunton-Spall of the Guardian into a useable dataset which saved a lot of work for those groups using the tweets as the raw data for their projects.

Other projects included mapping deprivation in manchester and a legal website that if it comes off will really be one to watch. All brilliant stuff.

Hacks and hackers we need you

Give the increasing amount of raw data that organisations are pumping out journalists will find themselves vital in making sure that they stay accountable. But I said in an earlier post that good journalists don’t need to know how to do everything, they just need to know who to ask.

The day proved to me and, I think to lots of people there,  that asking a hacker to help sort data out is really worth it.

I’m sure there will be more blogs etc about the day appearing over the next few days.

Thanks to everyone concerned for asking me along.

Hacks and Hackers hack day Manchester

Any sufficiently complicated regular expression is indistinguishable from magic

A bit of a nod to Arthur C.Clarke there but something that hits home every time I do any hacking around under the bonnet of the interwebs.

When it comes to this data journalism malarky some might say (to steal another movie quote) a mans got to know his limitations. But I firmly believe a good journalist, when stuck, knows who to ask. I’m very excited that more and more journos are realising that there are no end of tools and motivated people who can be part of the storytelling process.

So I was delighted to be asked to be one of the judges for ScraperWiki’s hacks and hackers hack day in Manchester tomorrow and see that in action.

The event just one of a number of similar days around the UK.  The successes in Birmingham and Liverpool amongst others, mean that tomorrow should be fun.

If your going, see you there (later on). If not I’ll tweet etc (hashtag:#hhhmcr). as I can.

Editorial and commercial: Part of a journalists job description

John Slattery picked up on a job ad at the MEN for two community reporters. Great stuff. But commenting on the job description, he points out:

In a sign of the times, the ad also says: “The ability to identify editorial and commercial opportunities is key” as well as an excellent knowledge “of contemporary social media and a solid understanding of multimedia gathering”.

I wish I had that with me yesterday when I talked to third-year students about convergence. I talked about how convergence contributed to the problems paying for journalism (both consumer and provider).

I mentioned how this issue was not a rarified one, distant from the journalistic process.  Its going to have a very real impact, especially as hyperlocal grows. And, of course process,will have to change to accept that.

To illustrate that point I used a quote from ‘godfather of hyperlocal’ Rick Waghorn talking to The Independent about the nervousness of journalists when it comes to ‘things commercial’

They really don’t like the idea of knocking on the door and asking for an advert. Fascinating that those same journalists will knock on a door after a teenage boy is killed in a road accident. They see that as part of their journalistic DNA. Ask that same journalist to knock on the door and ask for a ten pound a week advert and its ‘that’s not my job’.  I think it will be their job on a level. Certainly on that local level anyway. We have to master new skills and from mastering new skills there will come a demand for new tools.

I pithily commented that in the future would have to do a death knock and add that for 10 quid you’d could do a really nice job on a obituary.

That’s a step too far, I know. But maybe the job ad goes some way to proving both of us right (and what many of us already know) the economics of news is everyones business, especially  journalists.

Digital Journalism: Ethics and ethos

Twitter through up an interesting link to NYU’s  Journalism Handbook for Students: Ethics, Law and Good Practice. I was particually taken with their Ethics pledge which all students are expected to sign or “The final grade for a student registered in a journalism course will not be submitted to the Registrar”.

It begins with:

As a New York University journalism student, you are part of a community of scholars at a university recognized for its research. A scholar’s mission is to push forward the boundaries of knowledge; a journalist’s mission is to serve the public by seeking out and reporting the facts as accurately as possible. Good journalists and scholars share a commitment to the same principle: integrity in their work.

By signing this ethics pledge, you agree to maintain the highest standards of honesty and foster ethical behavior at all times. Anyone who fails to uphold these ethical standards has committed a serious violation of this agreement. Penalties can range from an F on an assignment to a failing grade in a course to expulsion, depending on the decision of the instructor in consultation with the Institute’s Ethics Committee.

Serious stuff.  The idea that an ethics comittee within an institution would consider, and rule upon,  proffessional ethics outside of the purley academic is challenging but, I think, right. Behaviour like Plagiarism is cited as the kind of behaviour that breaks the pledge and could get you hauled up.

Now we take plagiarism serioulsy but it’s an academic issue, there are serious punishments, but academic none the less. The ethics comittee oversees research activity. We also hammer home the Society of Editors code of conduct etc.  But I’d love it to be more directly asssociated with the professional ethics of journalism – more proffession based.

Defining a digital journalist.

The pledge chimed with me as I’m updating my Digital newsroom class for this year. The class handbook includes a page that outlines the ‘module ethic’:

This module is not about defining a digital newsroom.

This module looks at the way digital and online practice affects newsrooms
and how that, in turn, changes and develops individual journalism practice.

We will explore this by :

  • Looking at the context in which digital and online practice has
    developed and how that has changed newsroom practice
  • Looking at the tools used and evaluating how they can be used to
    create content.

You will use one to inform the other in a way that suits your practice.
As you do this module there are two things to keep in mind.

  • We are platform agnostics: You can be a newspaper, radio,
    magazine, TV or online journalist and still be digital
  • We are consumers and providers: Think about what it takes to
    produce the content you use everyday.

But most of all, remember: You are a digital journalist!

Whatever their motivation for getting in to journalism, whichever media they see themselves working in, understanding how digital tools and practice can fit in to their practice is what being a digital journalist is all about. That last bit is a given whether they like it or not.

I can’t get students to sign-up to it and if they ignore it there is no ‘ethos panel’ but at least we start from a common ground.

Image credit: WCN247 on flickr

Enhanced by Zemanta