Meet other developers and web creators to discuss creating websites to web standards.

Linked Data

Posted: September 27th, 2009 | Author: Nick Smith | Filed under: linked data, semantic web | 3 Comments »

On Monday 7 September London Web Standards was pleased to host the BBC’s Michael Smethurst, Information Architect and Yves Raimond, Software Engineer. See their presentation slides and the following is a recap of the key points.

Some background

Some think that the web is simply the internet plus links. In fact, it’s the internet plus HTTP (Hypertext Transfer Protocol). It’s an important distinction and something that gets forgotten. For example, discussions on accessibility often focus on documents. HTTP and URIs (Uniform Resource Identifiers) don’t get mentioned, they are the other side of the web that gets ignored. Where HTML has been through many revisions, HTTP is still at version 1.1, this standard has stood the test of time.

The web is the Internet, HTTP and HTML (to a lesser extent)

Everything that’s good about the web is linking. People need and want to share content, therefore they need handles, or links, to use. In Search Engine Optimisation (SEO), Google PageRank holds HTTP and the URI as more important than the HTML of the page. By definition, a page of HTML is not a web page until it’s linked to something else.

Crucially, we need to get from a web of documents to a web of things. Linked Data allows for this further level of abstraction. Where humans can make meaning out of the content of a web page, the web needs to progress to a stage where machines can understand it.

Now for the talk

Some types of Linked Data:

  • RDF. This was described more than ten years ago. It’s very easy as it’s just data. It works in an Object > Predicate > Object fashion for example: <sky> <has colour> <blue>. This information forms a graph.
  • FOAF (Friend of a Friend). When it was first described, people focused on this because it was the only spec that had traction.
  • REST (Representational State Transfer). This explains proper use of HTTP. Separation of resources and representations of it. For example, browsers such as Safari and Firefox have a list of preferences such as your preferred language or context (e.g. mobile). When you request a document from a URI you are getting a representation of the resource that matches your headers. The browser and server work together to understand what version of the document you want. Your browser searches for the correct MIME types that suit these preferences. The copy of the document you see is only a representation that best matches your requirements. So, sometimes if the server can’t find a version that fits, it could return a “406 not acceptable” error. This concept is important when combating the ghettoisation of mobile content. The device you use and your language are all important, the same URI could serve to any context. This is important if you don’t want to split your “Google Juice”, as Michael puts it. AKA, your Google PageRank.

Linked Data = Links + HTTP + RDF

Between the development of FOAF and about 2006, there have been lots of new ideas for Linked Data, but all have been too complex. By extension, people thought the semantic web was too complex. Linked Data still uses RDF, but the difference is that now it’s about things not documents. Example, the Eastenders website may have a file that you’ll be able to ask for in several flavours. Linked Data is about giving information that you can’t get “down the wires”.

Using Linked Data you can describe the relationship between different aspects of the same resource. E.g. you can describe the relationship between a website and its creator. Moustaki.org was created by Yves, but not the other way around. You can use Linked Data to make claims about documents and about things.

What’s the difference between Linked Data and Microformats?

Linked Data provides another level of abstraction. Instead of relating two documents (e.g. with a REL statement), you relate two things. Also, with Linked Data there are lots of ways to publish these relationships:

  • Hashes. Resources are defined relatively to a document. If you’re concerned about the number of GET requests, this is a good option because you don’t need a 303 error page.
  • Slashes. A very popular option, but also the hardest to convey information. Michael and Yves show an example of content negotiation where often with this system two GET requests are needed. On the plus side, the server doesn’t care about anything after the hash.
  • RDFa. The cheapest to set up.

Find out more about hashes versus slashes.

What’s the point of Linked Data?

Seperate individuals may host different content about the same thing. The English band The Fall, was used as an example of the power of this abstraction. Linked Data can be used to aggregate information specifically about this band. Linked Data is a web scale database. The OWL:sameAs statement is a prime example of this kind of functionality.

But how do we handle conflicts? As with any web browsing, you must keep track of where you get any data from. Also anyone can make a claim on a URI. Your weighting could be based on PageRank. Spam is also a problem and OWL:sameAs can go wrong with the language eg, there are two bands called ‘U2′ (one Japanese, one Irish). There are sixteen bands called ‘Arora’ etc.

What are the pitfalls, how is it being used and developed?

FRBR (Functional Requirements for Bibliographic Records) is still being developed, lots of work is being done on its different classes. But music ontology is still an area that needs development. There is an need to make a distinction between composer, performances, signal etc.

A recent example of the problems of this web database came when researchers at the BFI discovered the history of a famous actor who had acted in a porn film. They added this to his record. Despite it being true, the actor didn’t want this to be known and the BFI were deemed to be in the wrong.

Search engines are already using Linked Data. For example, Google is publishing content from Bestbuy. Also, social media is already using Linked Data in their activity feeds. RDF is mostly populated by social data. US porn websites are obliged to use RDF. To find out more about the whereabouts of RDF, go to: http://pingthesemanticweb.com/


Important changes to London Web Standards

Posted: September 2nd, 2009 | Author: bendodson | Filed under: Announcements | Tags: | No Comments »

Dear London Web Standardistas,

We have a number of important updates to share with the community today.

Our beloved Jeff Van Campen is moving on to become a father for the first time and has handed over the important activity of organising London Web Standards to a new group of enthusiasts. We would like to wish Jeff the best of luck in future and look forward to seeing him sneak out of from the clutches to parenthood every now and then to discuss the important matters of properly formed HTML5, rounded corners with CSS3, linked data, RDFa and so on!

London Web Standards is moving from strength to strength. We have now set up our own dedicated web site at http://www.londonwebstandards.org and have our own professional logo. This megalithic structure will require looking after by a team of well rounded individuals who know a thing or two about these matters. At our handover meeting, we decided the following team structure:

Ben Dodson (@bendodson) will be the main organiser of events and the venue
Nick Smith (@twitrnick) and Jason Grant (@flexewebs) shall be documenting decisions and maintain the content on the web site
Rob Enslin (@robenslin) will be taking care of the social network engagement aspects of LWS
Joe Lanman (@joelanman) will be in charge of back end technical infrastructure for LWS web site

Our last two meetings have been held at ‘The Square Pig’ near Holborn and we will be continuing to use this venue for the foreseeable future. It’s a great central venue with projection facilities and good beer…

Now our website is up and running, we would like to grow the group further in a very open manner and have therefore decided to move over to Upcoming for event organisation as well as Yahoo Groups for various discussions. Our Twitter channel (@webstandards) is still open as usual. We’ll be running a website design competition shortly (more details at the meetup) and hopefully all of the above changes will become a lot clearer then!

Please make sure to signup to the new Yahoo Group as we will be sending all future emails from there. We will no longer be using meetup.com after this months meeting.

We will go over all of these changes at the next meeting which is on 7th September at 7pm at The Square Pig in Holborn and will feature Michael Smethurst (BBC Information Architect) and Yves Raimond (BBC Software Engineer) who will be leading a discussion on Linked Data. They will be covering the basics of Linked Data, and will then be taking any questions or comments. We will also be running some quick-fire 5 minute warm up speeches at the beginning of the meeting covering a wide ranging number of topics. In our October meeting, Phil Archer will be giving a presentation on Mobile Web Best Practices and we have a presentation around web based typography planned for November. If you know of anybody who may want to present in the future, then please email Ben via ben.dodson@londonwebstandards.org

It’s going to be a busy time for us whilst we make the changes listed above but if you have any suggestions on how we can do things better then please let us know by emailing organisers@londonwebstandards.org – we look forward to seeing you on 7th September.

London Web Standards Team