I'm sure it'll be eons before it trickles down into the museum sector, but it's an interesting change:

Nielsen/NetRatings to use total time spent by users of a site as its primary measurement metric

In a nod to the success of emerging Web 2.0 technologies like AJAX and streaming media, one of the country's largest Internet benchmarking companies will no longer use page views as its primary metric for comparing sites.

Nielsen/NetRatings will announce Tuesday that it will immediately begin using total time spent by users of a site as its primary measurement metric.

Nielsen/NetRatings will still report page views as a secondary metric, and it will continue to reevaluate its primary metric as technology continues to evolve, Ross added. "For the foreseeable future, we will champion minutes if you are comparing two sites. Going forward, we'll see what that equates to in terms of true advertising opportunity," he said.

Meta-social networking

I've been wondering how long it would take for a meta-social networking site to emerge (or whether I should create one thereby making millions), allowing you to maintain active accounts on facebook, myspace, etc, with one single interface to read and post messages and comments, but of course Wired got there first. Sorta.

And yes, I did mean to post that many months ago! But it's still relevant because interoperability is only going to become more important in the social networking world.

This post on the Gartner "Hype Cycle for Emerging Technologies 2007" report includes the familiar Gartner Hype Cycle diagram, updated for 2007, which is more than you'll get from the Gartner site (for free, anyway).

Common Craft have produced videos on RSS in Plain English, Social Bookmarking in Plain English, Wikis in Plain English and Social Networking in Plain English (via Groundswell)

Also worth a look, Google Code for Educators "provides teaching materials created especially for CS educators looking to enhance their courses with some of the most current computing technologies and paradigms". They say, "[w]e know that between teaching, doing research and advising students, CS educators have little time to stay on top of the most recent trends. This website is meant to help you do just that" and it looks like it might also be useful for busy professionals who want to try new technologies they don't get time to play with in their day jobs (via A Consuming Experience).

Also from A Consuming Experience, a report on a talk on "5 secrets of successful Web 2.0 businesses" at the June London Geek Dinner.

On a random note, I noticed that the BBC have added social bookmarking to their news site:

I wonder if this marks the 'mainstreaming' of social bookmarking.

European search engine

EU OKs German Online Search-Engine Grant

The European Union on Thursday authorized Germany to give $165 million for research on Internet search-engine technologies that could someday challenge U.S. search giant Google Inc.

The Theseus research project — the German arm of what the French call Quaero — is aiming to develop the world's most advanced multimedia search engine for the next-generation Internet. It would translate, identify and index images, audio and text.

Fragmented European research efforts are one of the reasons blamed for the region lagging behind the United States in information technology. European companies in general spend far less on research than those based in other parts of the world, and the EU said the project should help change that.

I wonder how they'll identify and weight or rank European content. And will it be tied in with the European Digital Library?

I'm still catching up on news and various RSS feeds, here are just a few things that caught my eye.

These slides from a presentation on Open Source applications in archaeology are worth a look. They've included lots of screenshots, which is useful because it demonstrates that open source applications are becoming much more user-friendly.

Wired makes a compelling case for Twitter as a 'Social Sixth Sense':

Twitter and other constant-contact media create social proprioception. They give a group of people a sense of itself, making possible weird, fascinating feats of coordination.

In theory I just don't get Twitter but in practice I do read some long-running threads on various forums where people can post a quick rant about work, about their love life, or just add a random disclosure. If I know the people posting then I find those threads interesting. And I also love Facebook status updates for the same reason – they don't require a response but sometimes it's nice when they trigger one.

Final diary entry from Catalhoyuk

I'm back in London now but here goes anyway:

August 1
My final entry of the season as I'm on the overnight train from Cumra to Istanbul tonight. After various conversations on the veranda I've been thinking about the intellectual accessibility of our Catalhoyuk data and how that relates to web publication and this entry is just a good way to stop these thoughts running round my head like a rogue tune.

[This has turned into a long entry, and I don't say anything trivial about the weather or other random things so you'd have to be pretty bored to read it all. Shouldn't you be working instead?]

Getting database records up on the website isn't hard – it's just a matter of resources. The tricky part is providing an interesting and engaging experience for the general visitor, or a reliable, useable and useful data set for the specialist visitor.

At the moment it feels like a lot of good content is hidden within the database section of the website. When you get down to browsing lists of features, there's often enough information in the first few lines to catch your interest. But when you get to lists of units, even the pages with some of the unit description presented alongside the list, you start to encounter the '800 lamps' problem.

[A digression/explanation – I'm working on a website at the Museum of London with a searchable/browsable catalogue of objects from Roman Londinium. One section has 800 Roman oil lamps – how on earth can you present that to the user so they can usefully distinguish between one lamp and another?]

Of course, it does depend on the kind of user and what they want to achieve on the Londinium site – for some, it's enough to read one nicely written piece on the use of lamps and maybe a bit about what different types meant, all illustrated with a few choice objects; specialist users may want to search for lamps with very particular characteristics. Here, our '800 lamps' are 11846 (and counting) units of archaeology. The average user isn't going to read every unit sheet, but how can they even choose one to start with? And how many will know how to interpret and create meaning from what they read about the varying shades of brown stuff? Being able to download unit sheets that match a particular pattern – part of a certain building, ones that contain certain types of finds, units related to different kinds of features – is probably of real benefit to specialist visitors, but are we really giving those specialist visitors (professional or amateur) and our general visitors what they need? I'm not sure a raw huge list of units or flotation numbers is of much help to anyone – how do people distinguish between one thumbnail of a lamp or one unit number and another in a useful and meaningful way? I hope this doesn't sound like a criticism of the website – it's just the nature of the material being presented.

The variability of the data is another problem – it's not just about data cleaning (though the 'view features by type' page shows why data cleaning is useful) – but about the difference between the beautiful page for Building 49 and rather less interesting page for Building 33 (to pick one at random). If a user lands on one of the pages with minimal information they may never realise that some pages have detailed records with fantastic plans and photos.

So there are the barriers to entry that we might accidentally perpetuate by 'hiding' the content behind lists of numbers; and there is the general intellectual accessibility of the information to the general user. Given limited resources, where should our energies be concentrated? Who are our websites for?

It's also about matching the data and website functionality to the user and their goals – the excavation database might not be of interest to the general user in its most raw form, and that's ok because it will be of great interest to others. At a guess, the general public might be more interested in finds, and if that's the case we should find ways to present information about the objects with appropriate interpretation and contextualisation, not only to present information about the objects but also to help people have a more meaningful experience on the site.

I wonder if 'team favourite' finds or buildings/spaces/features could be a good way into the data, a solution that doesn't mean making some kinds of finds or some buildings into 'treasure' and more important than others. Or perhaps specialists could talk about a unit or feature they find interesting – along the way they could explain how their specialism contributes to the archaeological record (written as if to an intelligent thirteen year old). For example, Flip could talk about phytoliths, or Stringy could talk about obsidian, and what their finds can tell us.

Proper user evaluation would be fabulous, but in the absence of resources, I really should look at the stats and see how the site is used. I wonder if I could do a surveymonkey thing to get general information from different types of users? I wonder what levels of knowledge our visitors have about the Neolithic, about Anatolian history, etc. What brings them to the website? And what makes them stick around?

Intellectual accessibility doesn't only apply to the general public – it also applies to the accessibility of other team's or labs content. There are so many tables hidden behind the excavation and specialist database interfaces – some are archived, some had a very particular usage, some are still actively used but still have the names of long-gone databases. It's all very well encouraging people to use the database to query across specialisms, but how will they know where to look for the data they need? [And if we make documentation, will anyone read it?]

It was quite cool this morning but now it's hot again. Ha, I lied about not saying anything trivial about the weather! Now go do some work.
(Started July 29, but finally posted August 1)

The BBC says "Photo tool could fix bad images" but I think it's far more likely to be used to create fake images. I guess I wouldn't have thought of an image that shows what was actually present as 'bad' – maybe it's not the best postcard if you want the recipient to think you were in an undeveloped paradise, but it's an accurate depiction of the scene.

I'm reminded of the way the Soviets would remove people from historic photos when they fell out of favour – now the ability to rewrite history is available for you at home!

Catalhoyuk diaries: In the absence of real world content…

…more diary entries from Catalhoyuk.
July 24, started late at night in tent. [but posted as July 25, 2007]
Spent some time whizzing things around in ArcGIS the past few days. I never get to play with GIS at work so it's quite fun. I need to talk to Cord and Dave about what views/tables they need in the database to link in the excavation and finds data. It's a shame we didn't get to experiment with bringing data across before Cord left but hopefully Dave and I can do a 'proof of concept' that she can play with when she gets back to site. Maybe skellies or X-finds, or just basic unit information as a first go.

Today was going to be a solid day of programming, but the power was out for three hours last night and two hours of lab time this morning, so I'm still catching up on the stuff I was going to finish last night.

Last night I wrote myself a note from the geek perspective about "I think the challenge of Catal is combining the reflexive, the uncertain, the indeterminate, with the rigorous requirements of structured recording in a database; and perhaps more importantly, convincing people that it's possible to design to allow for uncertainty and for multivocality" but in the light of day that sounds like pretentious tosh that could only have been thought up in the middle of the night. Well done me.

July 30, middle of the day.
It's really been quite hot, though just now a change is coming through and it's getting cooler. I'm such a (lab, not southern) jessie, I can't imagine what it's like working up on the mound – apparently it's been 48C in the south shelter. The feel of the coming storm reminds me of Melbourne but I bet the heat won't break after the storm like it would at home. I hope my borrowed tent doesn't blow away.

Very frustrated by the power cuts. I feel like I'm losing lots of work time to them. My list of things to do is getting longer and my time is getting shorter. I wish it was the other way around.

I've been documenting some of the database tables in preparation for an informal tutorial on database querying tomorrow. I say 'informal' but that's really just cos I don't have time to prepare so I'll wing it. Hopefully people will have some good examples. I think it'll also point out where we need to make improvements – putting all the relationships into the central database is probably the first thing to do, so that they automatically come through into the AllTables database. This will make it a lot easier for people to join tables as some joins will be created auto-magically for them. It's probably not worth documenting all the tables at the field level, but there are some (where the data type is different between old databases, for example) where it would be useful. The descriptions could also serve as synonyms to help people find the tables they need for their queries.

Update from Catalhoyuk

In the absence of real updates, I thought I'd post some of my site diary entries. The power has been very dodgy the past few days so I might do a big catch up on email and whatever in Konya on Friday (our day off). Interestingly, Blogger has decided to present me the site in Turkish, presumably based on IP location, because the language settings on the browser are English-only. So if things go a bit strange it's because I can't really see what I'm doing.

19/07/2007
My first day on site this season. I feel like I've been pole-axed with tiredness after the trip out here, so I'm concentrating on catching up on Sarah's documentation [for her work on site this year] and generally remembering how everything works.

24/07/2007
Just had a random thought, though it's a shame I didn't think of it at the start so we could tell everyone who's been on site over the season – any blog posts, photos or videos, blah blah blah, could use the same tag (like 'catal07') on public content, so it's easier to find everything from this season regardless of where it's held.

[And now that I'm posting this on a blog I suppose I should do that myself]

Tuesday, July 24, 2007

I keep bouncing between looking at the Figurines and Ceramics databases.
I've been nabbing poor Chris whenever he comes anywhere near the computer room and asking questions about heat treatment and cores; he's been very patient.

We had a long meeting in the cafe on Sunday, and I re-jigged the recording structures afterwards. I think possibly last year's structure was too ambitious, given the time constraints on everyone – not only for building it, but for mapping data from old structures to the new and mostly for the time it takes to simply record the objects as it went into lots of technical detail that probably isn't sustainable at the moment.

With that in mind, I tipped the recording model on its head so that it's much more about observation than interpretation at this stage, particularly for colours and the various things that variations in colour indicate. For example, rather than breaking heat exposure down into manufacture, use, other events or post-deposition, for the moment it's enough to record that it's present. I've designed the forms to allow people to record the probable type of heat exposure (and how certain they are about it) if there's evidence to support it, but if there's no evidence either way they don't have to record a probable reason. The structures can be extended as we find out more about the raw materials around the site – I think they might change views on the intentionality represented by the presence of various inclusions.

I've spent the day reviewing the ceramics database structures with a view to normalising them, and also to fitting them into the shared clay recording system. It's a continuation of work from previous seasons but with the added pressure(?) challenge(?) that other teams will be using versions of the ceramics databases soon too, so it's really important to get the data structures right. Nurcan has been really helpful and her explanations of some of the changes have helped me think about the best solutions to her recording issues.

Journalists were out yesterday, we had our photo taken under the Catalhoyuk sign near the gate. Apparently it'll be in Wednesday's papers. I wonder if people in London could pick up copies in the off-licences around Green Lanes. 'Famous in Turkey' – sounds like a band name.

It's funny how the diary entries are starting to read like blog entries, and in a way they seem to be functioning a bit like a blog too, with people commenting on each other's entries. I almost feel like I should add a field so people can record which diary entry they're writing about alongside the units, etc, but would that be far too self-referential?

When the database goes back to London and is put on the web I think we should put an 'AddThis' button on the various diary, finds and excavation pages so people can add pages to social bookmarking sites, blogs, etc. If we sign up for an account we can see how it's used – I wonder how much activity that kind of 'passive' use would see compared to 'active' use like commenting on finds or excavation data. I really need to find out more about the barriers to participation for actively creating content. I'll suggest the 'AddThis' thing to Ian and Shahina if I get a chance.

I'm starting to really wish I'd had my hair cut before I left London because it's taken on a life of its own. Not that it really matters out here, I guess.

I'm off to Catalhoyuk for two weeks

I'm off to Çatalhöyük, Turkey, for two weeks, for the usual database analysis/design/development. I don't imagine I'll have any time to post updates, so you can make do with photos from previous years in the meantime.

www.flickr.com

Catalhoyuk 2004, 2005, 2006 photoset