Press "Enter" to skip to content

Month: June 2015

Week 4 of Structured Stories NYC: Thinking like “structured journalists”

At the halfway mark in our eight-week Structured Stories project on Friday, Natalie, Ishan and I decided to measure our performance.

By the numbers, we’ve made substantial progress — we’ve created 182 new events in 15 stories, all of which are now live on the Structured Stories website.

The more events and stories that we input, the more we find that our thinking about narrative stories changes. Increasingly, we notice ourselves deconstructing the news as we read it, breaking down articles into a series of finite events, and dicing those events into their primary nouns and verbs.

We’ve learned not to worry about engaging leads or colorful language. Instead, we focus on crafting clear, concise and specific events that are easily “structurable,” to use a term recently coined by David.

We are, in other words, finally beginning to think like structured journalists.

But a number of questions remain. In fact, sometimes it feels like the more progress we make, the more questions — big and small, technical and editorial — we have.

We’re helping David make improvements to the content management system. As we input events for our four main topics — policing, bail reform, housing and Uber — we’ve found more than 25 bugs. The list of unresolved editorial issues currently stands at 56 — a number indicative of how much we’ve learned, but daunting nonetheless.

One of our most persistent struggles remains translating events we intuitively understand in language to structured events.

In a traditional article, for example, it makes sense to say that airports have started ticketing Uber drivers. In a structured story, however, this statement would have to be attached to a specific event — with a specific authority, time and place.

We’ve tackled issues like these in hours of daily check-in Skype sessions with David, countless messages to David on Slack and near-constant discussion among ourselves.

David has patiently reassured us that this question-filled dialogue is not only natural, but also helpful in the long term. He’s reminded us that we’ve used language for tens of thousands of years, but that this data-driven approach to narrative is still nascent.

“Finding an alternative to using language in writing is a pretty audacious goal,” he noted. “It makes sense if it feels a little weird, a little unnatural at first.”

Comments closed

Week 3 of Structured Stories NYC: Getting the hang of it

There’s a particular anxiety that hits me whenever someone asks me to explain what I’m doing this summer.

I fumble through an answer with phrases like “news database” and “knowledge graphs” and “combinatorial explosion” only to face blank stares and quietly confused nods. In the end, I always wind up telling people to just wait and see, promising it will all be clear(er) once our work began appearing on the site.

We finally reached that point on Wednesday when Ishan, Rachel, and I started publishing our stories online.

As Ishan explained last week, our stories are made up of events—hundreds of them so far. Each requires the creation of an “event frame,” such as “[A Character] passed [A Law]” or “[A Character] published [An Object] about [A Topic].” To then make an event, we simply put information in the brackets and tag each one with a date, location, and primary sources. The final touches are the bullet points and summaries that the reader will see.

The process strips events to their core, leaving no room for color or flowery language. In David’s words, “It’s like old school reporting from the 50’s—just the facts, just ‘who/what/when/where.’”

Interestingly enough, the most challenging part was the creation of seemingly-simple event frames. Our first efforts were markedly “off,” but through lots of trial and error––and David’s infinite patience––we’ve started to get the hang of it.

Making the event frames means wrestling with that fine line between specificity and simplicity. We find ourselves debating whether “presenting a plan” requires a “communication” or “submitting a document” frame. It’s a small distinction, but it is key to the bigger issue: translating language to structure.

As we continue to add frames, events, and stories to the website, the list of “bugs” and “issues” gets longer and longer. But far from being discouraging, this document is in many ways the most valuable output of all in our experiment this summer––”the gold mine,” as David called it.

With every little question or problem we’re coming closer to understanding Structured Stories and what it could become—and closer to having an answer when people ask just what it is we’re doing this summer.

Comments closed

Week 2 of Structured Stories NYC: Nouns, verbs and learning to write again

We hit the ground running last week, eager to begin constructing structured stories on our topics.

I Googled everything I could about “NYC Housing” and was quickly overwhelmed. For some context, I stopped by City Hall for a hearing on the New York City Housing Authority’s plan to erase its deficit. Rachel researched Mayor de Blasio’s relationship with the police, and pieced together the myriad of events that soured their bond. Natalie tracked Uber’s meteoric rise and the subsequent PR nightmares that engulfed the ride-sharing company.

We hit our stride after a few days and marched through a routine: research, input an event in a structured story, repeat. It was slow work, especially if we had to verify conflicting accounts by checking primary sources.

“There’s something noble about making sure everything is correct,” said Natalie, alluding to our satisfaction once we solved something and could (finally) move on. Every day, we managed to finish dozens of “events,” the individual units that form the backbone of Structured Stories.

But, in a testament to how different this work is from traditional journalism, we are still having difficulty adapting to the unique writing requirements.

A structured story is different than “regular” writing because it’s all about breaking the news into data.

The data comes in two flavors: verbs and nouns. Verbs can be linked back to the FrameNet database, an expansive project that tracks meaning. Amazingly, the FrameNet database can be read by both humans and computers. It translates complex human meaning into data.

Nouns come from Freebase, a sprawling database owned by Google. Freebase assigns items unique identifiers, and we use these IDs to track characters or topics over time. De Blasio, for instance, is known in the database as  /m/0gjsd3.

An example of the underlying event structure that powers the Structured Stories platform.
An example of the underlying event structure that powers the Structured Stories platform.

In Structured Stories, combining verbs and nouns creates a data-rich event. And that data can be manipulated, allowing readers to see links between stories or track events over time. That’s the power of structure.

David Caswell, the creator of Structured Stories, told us our confusion was natural. A structured story in its raw verb/noun form is not meant to be read by a human. In fact, most readers won’t see the structured view when they visit the Structured Stories platform. They’ll read the bullet-points or summaries, which Rachel, Natalie and I write after we have structured an event. Bullet-points and summaries are the “normal” human sentence behind an event. Underlying that sentence, though, is a web of connections and malleable data that will provide readers with new information they have never been able to get before.

This project makes me feel like I’m learning to write again. I’m paying extra attention to nouns and verbs and stripping events to their core meaning. There seems to be a constant tug of war between language and structure when writing these events, with the ideal falling somewhere in the middle.

For now, we’re still searching for that happy medium.

Comments closed

Week 1 of Structured Stories NYC: Unlocking the atoms of news

On Wednesday, as our Structured Stories NYC team debated whether to use “text object” or “information artifact” to describe a field in our database, I realized we were in new journalistic territory.

The debate illustrated the unique approach of our summer experiment. Instead of publishing traditional news stories with headlines and text, two Duke students (Natalie Ritchie and Rachel Chason) and one recent Duke grad (Ishan Thakore) are segmenting the news into chunks. That approach has been tried before by Circa and a few others, but Structured Stories goes far deeper, structuring the basic elements of nouns and verb phrases to reveal new truths about the news.

That structure enables us to link the elements in a myriad of valuable new ways. For example, “The taxicab commission held a public hearing on new regulations about for-hire vehicles” can be linked with characters and entities such as Mayor Bill De Blasio, Uber drivers and the taxicab commission. That structure will  empower readers in many different ways. They can easily find the latest developments in a long-running story (“Uber drivers held a protest against new regulations”) and they can interact with the “events” to reveal new patterns and relationships.

Structured Stories Team
The Structured Stories NYC team: (from left) Rachel Chason, Natalie Ritchie, Structured Stories creator David Caswell, and Ishan Thakore

There were a few moments during the week when I got a feeling that our project will be groundbreaking. David Caswell, the creator of the Structured Stories platform and our partner in the project, began the week with an excellent PowerPoint that explained how the students’ work will be published. Structured journalism is often known as “atomizing” the news.

A couple of times, I got the sense that we were like scientists who were about to unlock the atom of news. I summarized the first day by saying, “Mind blown.”

But there also were times where I wondered if we have too much structure in our approach and that we’ll end up creating a giant database with hundreds of humdrum entries on municipal government. We need to make sure that even though we’re using a unique approach we are still creating valuable, interesting journalism.

Our discussions during the week reflected our unique perspectives. David is a computer scientist with a really cool idea; I’m a journalist with an interest in new story forms and some old-fashioned values. There were some moments where our differences were quite clear. At one point during a discussion about story structure in the database, David told the students, “Your audience is actually a machine.”

I nearly had a heart attack. But then I realized he meant that the “story structure” the students are creating are not intended for public consumption. They’re designed to work behind the scenes so readers can get the information they want.

This is the genius of David’s approach (and also the part that scares me a little). With lots of structure inside the machine, readers will be able to get information in new ways.

We took a break from structured journalism to get a first-hand look at journalism without much structure: the Daily Show with Jon Stewart.
We took a break from structured journalism to get a first-hand look at journalism without much structure: the Daily Show with Jon Stewart.

David and I recognize this is very much an experiment. We both believe in the promise of structured journalism. But we also recognize that this is a very different way of covering the news and that sometime experiments fail. On Tuesday I tweeted:

Our approach on Structured Stories NYC is inspired by the philosopher Ms. Frizzle: “Take chances! Make mistakes! Get messy!”

Comments closed