Mashed museum and UK MW 2008 write-up

A report I wrote on 'The 2008 Mashed Museum Day and UK Museums on the Web Conference' is now live on the Ariadne site. I've already reported on most of the sessions and the mashed museum day here, but the opportunity to reflect on the day and write for a different audience was useful. The review really made me appreciate that time and space away from all the noise of every day life in which to learn, try and think is incredibly important, whether you call it a workshop or an away day or something else entirely:

One lesson from the Mashed Museum day was that in a sector where innovation is often hampered by a lack of financial resources, time is a valuable commodity. A day away from the normal concerns of the office in 'an environment free from political or monetary constraints' is valuable and achievable without the framework of an organised event. An experimental day could also be run with ICT and curatorial or audience-facing staff experimenting with collections data together.

The Ariadne issue is packed full of articles I've marked 'to read', so you might also find them interesting.

Quick and light solutions at 'UK Museums on the Web Conference 2008'

These are my notes from session 4, 'Quick and light solutions', of the UK Museums on the Web Conference 2008. In the interests of getting my notes up quickly I'm putting them up pretty much 'as is', so they're still rough around the edges. There are quite a few sections below which need to be updated when the presentations or photos of slides go online. [These notes would have been up a lot sooner if my laptop hadn't finally given up the ghost over the weekend.]

Frankie Roberto, 'The guerrilla approach to aggregating online collections'
He doesn't have slides, he's presenting using Firefox 3. [You can also read Frankie's post about his presentation on his blog.]

His projects came out of last year's mashed museum day, where the lack of re-usable cultural heritage data online was a real issue. Talk in the pub turned to 'the dark side' of obtaining data – screen scraping was one idea. Then the idea of FoI requests came up, and Frankie ended up sending Freedom of Information requests to national museums in any electronic format with some kind of structure.

He's not showing site he presented at Montreal, it should be online soon and he'll release the code.

Frankie demonstrated the Science Museum object wiki.

[I found 'how it works' as focus of the object text on the Science Museum wiki a really interesting way of writing object descriptions, it could work well for other projects.]

He has concerns about big top down projects so he's suggesting five small or niche projects. He asked himself, how do people relate to objects?
1. Lots of people say, "I've got one of these" so: ivegotoneofthose.com – put objects up, people can hit button to say 'I have one of those'. The raw numbers could be interesting.
[I suggested this for Exploring 20th Century London at one point, but with a bit more user-generated content so that people could upload photos of their object at home or stories about how they got it, etc. I suppose ivegotoneofthose.com could be built so that it also lets people add content about their particular thing, then ideally that could be pulled back into and displayed on a museum site like Exploring. Would ivegotoneofthose.com sit on top of a federated collections search or would it have its own object list?]
2. Looking at TheyWorkForYou.com, he suggests: TheyCollectForYou.com – scan acquisition forms, publish feeds of which curators have bought what objects. [Bringing transparency to the acquisition process?]
3. Looking at howstuffworks.com, what about howstuffworked.com?
4. 'what should we collect next?' – opening up discourse on purchasing. Frankie took the quote from Indiana Jones: thatbelongsinamuseum.com – people can nominate things that should be in a museum.
5. pricelessartefact.com – [crowdsourcing object evaluation?] – comparing objects to see which is the most valuable, however 'valuable' is defined.
[Except that possibly opens the museum to further risk of having stuff nicked to order]

Fiona Romeo, 'Different ways of seeing online collections'
I didn't take many detailed notes for this paper, but you can see my notes on a previous presentation at Notes from 'Maritime Memorials, visualised' at MCG's Spring Conference.

Mapping – objects don't make a lot of sense about themselves, but are compelling as part of information about an expedition, or failed expedition.

They'll have new map and timeline content launching next month.

Stamen can share information about how they did their geocoding and stuff.

Giving your data out for creative re-use can be as easy as giving out a CSV file.
You always want to have an API or feed when doing any website.
The National Maritime Museum make any data set they can find without licensing restrictions and put it online for creative re-use.

[Slide on approaches to data enhancement.]
Curation is the best approach but it's time-consuming.

Fiona spoke about her experiments at the mashed museum day – she cut and paste transcript data into IBM's Many Eyes. It shows that really good tools are available, even if you don't have resources to work with a company like Stamen.

Mike Ellis presented a summary of the 'mashed museum' day held the day before.

Questions, wrap up session
Jon – always assume there (should be) an API

[A question I didn't ask but posted on twitter: who do we need to get in the room to make sure all these ideas for new approaches to data, to aggregation and federation, new types of experiences of cultural heritage data, etc, actually go somewhere?]

Paul on fears about putting content online: 'since the state of Florida put pictures of their beaches on their website, no-one goes to the beach anymore'.

Metrics:
Mike: need to go shout at DCMS about the metrics, need to use more meaningful metrics especially as thinking of something like APIs
Jon: watermark metadata… micro-marketing data.
Fiona: send it out with a wrapper. Make it embeddable.

Question from someone from Guernsey Museum about images online: once you've downloaded your nice image its without metadata. George: Flickr like as much data in EXIF as possible. EXIF data isn't permanent but is useful.

Angela Murphy: wrappers are important for curators, as they're more willing to let things go if people can get back to the original source.

Me, referring back to the first session of the day: what were Lee Iverson's issues with the keynote speech? Lee: partly about the role of institution like the BBC in modern space. National broadcaster should set social common ground, be a fundamental part of democratic discussion. It's even more important now because of variety of sources out there, people shutting off or being selective about information sources to cope with information overload. Disparate source mean no middle ground or possibility of discussion. BBC should 'let it go' – send the data out. The metric becomes how widely does it spread, where does it show up? If restricted to non-commercial use then [strangling use/innovation].

The 'net recomender' thing is a flawed metric – you don't recommend something you disagree with, something that is new or difficult knowledge. What gets recommended is a video of a cute 8 year old playing Guitar Hero really well. People avoid things that challenge them.

Fiona – the advantage of the 'net recomender' is it's taking judgement of quality outside originating institution.

Paul asked who wondered why 7 – 8 on scale of 10 is neutral for British people, would have thought it's 5 – 6.

Angela: we should push data to DCMS instead of expecting them to know what they could ask for.

George: it's opportunity to change the way success is measured. Anita Roddick says 'when the community gives you wealth, it's time to give it back'. [Show, don't tell] – what would happen if you were to send a video of people engaging instead of just sending a spreadsheet?

Final round comments
Fiona: personal measure of success – creating culture of innovation, engagement, creating vibrant environment.

Paul: success is getting other people to agree with what we've been talking about [at the mashed museum day and conference] the past two days. [yes yes yes!] A measure of success was how a CEO reacted to discovering videos about their institution on YouTube – he didn't try to shut it down, but asked, 'how we can engage with that'

Ross on 'take home' ideas for the conference
Collections – we conflate many definitions in our discussions – images, records, web pages about collections.

Our tone has changed. Delivery changed – realignment of axis of powers, MLA's Digital portfolio is disappearing, there's a vacuum. Who will fill it? The Collections Trust, National Museum Directors' Conference? Technology's not a problem, it's the cultural, human factors. We need to talk about where the tensions are, we've been papering over the cracks. Institutional relationships.

The language has changed – it was about digitisation, accessibility, funding. Three words today – beauty, poetry, life. We're entering an exciting moment.

What's the role of the Museums Computer Group – how and what can the MCG do?

Next-generation approaches at 'UK Museums on the Web Conference 2008'

Session 3, 'Next-generation approaches', of the UK Museums on the Web Conference 2008 was introduced by Jon Pratty.

Jon questioned, 'what is a virtual museum?. It can be pretty much anything. Lots of valuable historical documents aren't in 'online museum', they're just out there to be found by search. It raises the question – how much permanence should digital objects have?'.

George Oates, 'Sharing museum collections through Flickr'
Introducing the Flickr Commons project and talking about some early results. Some practical information on what it means to join the program, and things that have come out of it.

Flickr 'swerved in from left field' and bumped into museum people and librarians and archivists.

It started with Library of Congress thinking about how to engage with Web 2.0. They were looking for a Web 2.0 partner. They have 14 million images, about a million digitised.

Flickr is designed specifically to search and browse photos. It has a big infrastructure and supports interfaces in 8 languages. It has lots of eyeballs – "it's made of people".

From the Commons point of view, it's simply a service, organisations can publish content into it.

They hit a hurdle: can a collecting institution publish content onto a site like Flickr? As collecting institution, someone like the Library of Congress doesn't necessarily own the copyright or know who the copyright holder was. They devised a new statement – 'no known copyright restrictions' – this provided a way to use this content once institution had done as much work as they could to trace copyright so they could still publish if not able to trace copyright holders.

Might open up to other sorts of content.

What's it for? Increase access to public photography collections; gather context about them, [something else I missed].

Powerhouse – lots of the collection was geo-tagged. It means you can find photos from then and now, for example around the CBD of Sydney. [Cool! I love the way geo-tagging content lets you build up layers of history]

Brooklyn – it made sense to use their existing established Flickr account, so Flickr created functionality to support that. The Smithsonian joined on Monday.

Soon they'll have content from other partners including a charming collection from a tiny local museum.

Results:
Last 28 days Library of Congress – 15,000 [or 50,000?] views per day, 8 million views over last six months, 72,000 tags.
Powerhouse – 77,000 views (more views of that collection in one month than in the whole previous year), 3500 tags.
Brooklyn – figures affected by merged account issue.
Smithsonian – 10,000 views in first day, 100 new contacts

The numbers are probably affected by the ratio of photos e.g. smaller numbers when an institution has put fewer photos online.

"But, is it any good"?"
Suddenly there are conversations between Flickr users and institutions, and between Flickr users, contributing information and identifications.

They contribute the identification of places and people, with information about the history behind photos.

Now and then – people are adding their recent photos of a location via comments on Flickr.

Library of Congress have made a list of types of interactions [slides], they include the transcription of text on signs, posters, etc in background, geo-tags, non-English tags.

Institutional context and Flickr – bind them together with hyperlink, but being on Flickr frees a program from institutional constraints.

Flickr has been designed as a vessel or platform where interactions and conversations can happen.

The information that the community provides is proving useful. The Library of Congress has updated 176 records in catalogue, recording that it's based on 'information provided by Flickr Commons Project 2008'.

The Smithsonian found it was opportunity for collaboration between institutions/departments and staff.

How to join: the process is publish – interact – feedback.

What to think about: give a broad representation of what's in your collection. Think about placement of images in photostream and sets. Plan to attract special interest groups. Think about what is already digital, what is popular? It can direct your digitisation efforts with feedback from a live community. Or you could go into your stores or collections database and possibly digitised randomly.

How much metadata to include? How many fields from database into description of photo; more or less?

When: can be a challenge for institutions.

How? You could use the normal Flickr uploadr if you don't have too many images; or you could use API to write applications that will work with Collections Management Systems.

Who? Might be web technician and curator.

The catch? It costs $24.95 for a Pro account. But you get unlimited storage, and could conceivably put whole collection online.

The future:
It's a work in progress. Probably will end up developing tools like additional reporting
Grow gently (make sure institution can handle the changes and respond to interactions)
They will continue their focus on photographs, not photographs of objects "(sorry)". "Flickr is about … empathic photography"
"Go local" e.g. small archives in little towns – people can still participate even if they don't have a web team, or web site.
API methods, RSS
Searching, browsing, maps
Search across Commons coming soon. Maybe combine searches to see a map of photos taken in 1910.

'Sector-wide initiatives' at 'UK Museums on the Web Conference 2008'

Session 2, 'Sector-wide initiatives', of the UK Museums on the Web Conference 2008 was chaired by Bridget McKenzie.

In the interests of getting my notes up quickly I'm putting them up pretty much 'as is', so they're still rough around the edges. There are quite a few sections below which need to be updated when the presentations or photos of slides go online. Updated posts should show in your RSS feed but you might need to check your settings.

[I hope Bridget puts some notes from her paper on her blog because I didn't get all of it down.]

The session was introduced as case studies on how cross institutional projects can be organised and delivered. She mentioned resistance to bottom-up or experimental approach, institutional constraints; and building on emerging frames of web.

Does the frame of 'the museum' make sense anymore, particularly on the web? What's our responsibilities when we collaborate? Contextual spaces – chance to share expertise in meaningful ways.

It's easy to revert to ways previous projects have been delivered. Funding plans don't allow for iterative, new and emergent technologies.

Carolyn Royston and Richard Morgan, V&A and NMOLP.
The project is funded by the 'invest to save' program, Treasury.

Aims:
Increase use of the digital collections of the 9 museums (no new website)
No new digitisation or curatorial content.
Encourage creative and critical use of online resources.
[missed one]
Sustainable high-quality online resource for partners.

The reality – it's like herding cats.

They had to address issue of partnership to avoid problems later in project.

Focussed on developing common vision, set of principles on working together, identify things uniquely achievable through partnership, barriers to success, what added value for users.

Three levels of barriers to success – one of working in an inter-museum collaborative way, which was first for those nationals; organisational issues – working inter-departmentally (people are learning or web or whatever people and not used to working together); personal issues – people involved who may not think they are web or learning people.

These things aren't necessary built in to project plan.

Deliverables: web quests, 'creative journeys', federated search, [something I missed], new ways of engaging with audiences.

Web Quests – online learning challenge, flexible learning tool mapped to curriculum. They developed a framework. It supports user research, analysis and synthesis of information. Users learn to use collections in research.

Challenges: creating meaningful collection links; sending people to collections sites knowing that content they'd find there wasn't written for those audiences; provide support for pupils when searching collections. Sustainable content authoring tool and process.

[I wondered if the Web Quest development tools are extendible, and had a chance to ask Carolyn in one of the breaks – she was able to confirm that they were.]

Framework stays on top to support and structure.

Creative journeys:
[see slide]

They're using Drupal. [Cool!]

[I also wondered about the user testing for creative journeys, whether there was evidence that people will do it there and not on their blogs, Zotero, in Word documents or hard drives – Carolyn also had some information on this.]

Museums can push relevant content.

What are the challenges?
How to build and sustain the Creative Journeys (user-generated content) communities, individually and as a partnership?
Challenge to curatorial authority and reputation
Work with messiness and complexity around new ways of communicating and using collections
Copyright and moderation issues

But partners are still having a go – shared risk, shared success.

Federated search
Wasn't part of original implementation plan
[slide on reasons for developing]
Project uses a cross collection search, not a cross collection search project. The distinction can be important.

The technical solution was driven by project objectives [choices were made in that context, not in a constraint-free environment.]

Richard, Technical Solution
The back-end is de-coupled from front end applications
A feed syndicates user actions.

Federated search – a system for creating machine readable search results and syndicating them out.
Real time search or harvester. [IMO, 'real time' should always be in scare quotes for federated searches – sometimes Google creates expectations of instantaneous results that other searches can't deliver, though the difference may only be a matter of seconds.]

Data manipulation isn't the difficult bit

Creative Journeys – more machine readable data

Syndicated user interactions with collections.
Drupal [slide]

Human factor – how to sell to board
Deploy lightweight solutions. RAD. Develop in house, don't need to go to agency.

[I'd love it if the NMOLP should have a blog, or a holding page, or something, where they could share the lessons they've learnt, the research they've done and generally engage with the digital museum community. Generally a lot of these big infrastructure projects would benefit from greater transparency, as scary as this is for traditional organisations like museums. The open source model shows that many eyeballs mean robust applications.]

Jeremy Ottevanger and Europeana/the European Digital Library
[I have to confess I was getting very hungry by this point so you might get more detailed information from Jeremy's blog when he adds his notes.]
Some background on his involvement in it, hopes and concerns.
"cross-domain access to Europe's cultural heritage"
Our content is more valuable together than scattered around.

Partnership, planning and prototyping
Not enough members from the UK, not very many museums.
Launch November this year
Won't build all of planned functionality – user-generated content and stuff planned but not for prototype.

Won't build an API or all levels of multiple linguality (in first release). Interface layer may have 3 or 4 major languages; object metadata (maybe a bit) and original content of digitised documents.

Originals on content contributors site, so traffic ends up there. That's not necessarily clear in the maquette (prototype). [But that knowledge might help address some concerns generally out there about off-site searches]

Search, various modes of browsing, timeline and stuff.

Jeremy wants to hear ideas, concerns, ambitions, etc to take to plenary meeting.

He'd always wanted personal place to play with stuff.

[Similarly to my question above, I've always wondered whether users would rely on a cultural heritage sector site to collate their data? What unique benefits might a user see in this functionality – authority by association? live updates of data? Would they think about data ownership issues or the longevity of their data and the reliability of the service?]

Why are there so few UK museums involved in this? [Based on comments I've heard, it's about no clear benefits, yet another project, no API, no clear user need] Jeremy had some ideas but getting in contact and telling him is the best way to sort it out.

Some benefits include common data standards, a big pool of content that search engines would pay attention to in a way they wouldn't on our individual sites. Sophisticated search. Will be open source. Multi-lingual technology.

Good news:
"API was always in plans".

EDLocal – PNDS. EU projects will be feeding in technologies.

Bad news: API won't be in website prototype. Is EDLocal enough? Sustainability problems.
'Wouldn't need website at all if had API'. Natural history collections are poorly represented.

Is OAI a barrier too far? You should be able to upload from spreadsheet. [You can! But I guess not many people know this – I'm going to talk to the people who coded the PNDS about writing up their 'upload' tool, which is a bit like Flickr's Uploadr but for collections data.]

Questions
Jim O'Donnell: regarding the issue of lack of participation. People often won't implement their own OAI repository so that requirement puts people off.

Dan Zambonini: aggregation fatigue. 'how many more of these things do we have to participate in'. His suggestion: tell museums to build APIs so that projects can use their data, should be other way around. Jeremy responded that that's difficult for smaller museums. [Really good point, and the PNDS/EDL probably has the most benefits for smaller museums; bigger museums have the infrastructure not to need the functionality of the PNDS though they might benefit from cross-sector searching and better data indexing.]

Gordon McKenna commented: EDLocal starts on Wednesday next week, for three years.

George Oates: what's been most surprising in collaboration process? Carolyn: that we've managed to work together. Knowledge sharing.