<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Phil's JISC CETIS blog</title>
	<atom:link href="http://blogs.cetis.ac.uk/philb/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.cetis.ac.uk/philb</link>
	<description>I suppose I ought to put something here</description>
	<pubDate>Wed, 14 Mar 2012 15:34:28 +0000</pubDate>
	
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>CETIS publications, now on WordPress</title>
		<link>http://blogs.cetis.ac.uk/philb/2012/03/13/cetis-publications-now-on-wordpress/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2012/03/13/cetis-publications-now-on-wordpress/#comments</comments>
		<pubDate>Tue, 13 Mar 2012 14:53:06 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[CETIS-Content]]></category>

		<category><![CDATA[repositories]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=584</guid>
		<description><![CDATA[We have recently changed how we present our publications to the world. Where once we put a file on the web somewhere, anywhere, and entered the details into a home-spun publication database, now we use WordPress. We&#8217;re quite pleased with how that has worked out, so we&#8217;re sharing the information that might help others use [...]]]></description>
			<content:encoded><![CDATA[<p>We have recently changed how we present our <a href="http://publications.cetis.ac.uk/">publications</a> to the world. Where once we put a file on the web somewhere, anywhere, and entered the details into a home-spun publication database, now we use WordPress. We&#8217;re quite pleased with how that has worked out, so we&#8217;re sharing the information that might help others use WordPress as a  means of presenting publications to the world (a repository, if you like).</p>
<p><strong>Why WordPress?</strong><br />
First, what were we trying to achieve? The overall aims were to make sure that our publications had good exposure online, to have a more coherent approach to managing them (for example to collect all the files into one place in case we ever need to migrate them), and to move away from the bespoke system we were using to a system that someone else maintains. There were a few other requirements, we wanted something that was easy for us to adapt to fit the look and feel of the rest of our website, that was easy to maintain (familiarity is an important factor in how easy something is&#8211;it&#8217;s easy to use something if you know how to use it),  and we wanted something that would present our publications in HTML and RSS sliced and diced by topic, author, and publication type: a URL for each publication and for each type of publication and feeds for everything. We&#8217;re not talking about a huge number of publications, maybe 100 or so, so we didn&#8217;t want a huge amount of up-front effort.</p>
<p>We thought about Open Journal Systems, but there seemed to be a whole load of workflow stuff that was relevant to Journals but not our publications. Likewise we thought about ePrints and Dspace, but they didn&#8217;t quite look like we wanted, and we are far more familiar with WordPress. As a wildly successful open source project, WordPress also fits the requirement of being maintained by other people, not just the core programme, but all those lovely plugins and themes. So the basic plan was to represent each publication in a WordPress post and to use a suitable theme and plugins to present them as we wanted.</p>
<p><strong>The choice of theme</strong><br />
Having settled on WordPress the first decision was which theme to use. In order to get the look and feel to be similar to the rest of the CETIS website (and, to be honest, to make sure our publications pages didn&#8217;t look like a blog) we needed a very flexible theme. The most flexible theme I know of is <a href="http://wordpress.org/extend/themes/atahualpa">Atahualpa</a>, with over 200 options, including custom CSS snippets, parameters and HTML snippets it&#8217;s close to being a template for producing you own custom themes. So, for example, the theme options I have set include a byline of  <code>By %meta('By')%.  %date('F Y')%</code> which automatically inserts the additional metadata field &#8216;By&#8217; and the date in the format of my choice, all of which can be styled any way I want. I&#8217;ll come back to the &#8220;byline&#8221; metadata later.</p>
<p>One observation here: there is clearly a trade-off between this level of customisation and ease of maintenance. On the one hand these are options set within the Atahualpa theme that can be saved between theme upgrades, which is better than would have been the case had we decided to fork the theme or add a few lines of custom code to the theme&#8217;s PHP files. On the other hand, it is not always immediately obvious which setting in the several pages of Atahualpa theme options has been used to change some aspect of the site&#8217;s appearance. </p>
<p><strong>A post for each publication</strong><br />
As I mentioned above we can represent each publication by creating a WordPress post, but what information do we want to provide about each publication and how does it fit into a WordPress post? Starting with the simple stuff:</p>
<ul>
<li>Title of the publication -&gt; title of WordPress post.</li>
<li>Abstract / summary -&gt; body of post.</li>
<li>Publication file -&gt; uploaded as attached media.</li>
<li>Type of publication -&gt; category.</li>
<li>Topic of publication -&gt; tag.</li>
</ul>
<p>Slightly less simple:</p>
<ul>
<li>The date of the publication is represented as the date of the post. This is possible because WordPress lets you choose when to publish post. The default is for posts to be published immediately when you press the Publish button, however you can edit this to have them published in the past <img src='http://blogs.cetis.ac.uk/philb/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
<div id="attachment_596" class="wp-caption aligncenter" style="width: 217px"><img src="http://blogs.cetis.ac.uk/philb/files/2002/01/publishdate.png" alt="WordPress publication date option" width="207" height="193" class="size-full wp-image-596" /><p class="wp-caption-text">WordPress publication date option</p></div></li>
<li>The author of the publication becomes the author of the post, but there are some complications. It&#8217;s simple enough when the publication has a single author who works for CETIS, I just added everyone as an &#8220;author&#8221; user of WordPress and a WordPress admin user can attribute any given post to the author of the publication it represents. Where there are two or more authors a nifty little plugin called <a href="http://wordpress.org/extend/plugins/co-authors-plus/">Co-Authors Plus</a> allows them all to be assigned to the post. But we have some publications that we have commissioned from external authors, so I created an user called &#8220;Other&#8221; for these &#8220;external to CETIS&#8221; authors. This saves having a great long list of authors to maintain and present, but creates a problem of how to attribute these external authors, a problem that was solved using WordPress&#8217;s &#8220;additional metadata&#8221; feature to enter a &#8220;by-line&#8221; for all posts.  This also provides a nicely formatted by-line for multi-author papers with out worrying about how to add PHP to put in commas and &#8220;and&#8221;s.</li>
<li>The only other additional metadata added was an identifier for each publication, e.g. the latest QTI briefing paper is No. 2011:B02.</li>
</ul>
<p><strong>Presenting it all</strong><br />
As well as customisation for the look and feel, the Atahualpa theme allows for menus and widgets to added to the user interface. Atahualpa has an option to insert a menu into the page header which we used for the links to the other parts of the CETIS website. On the left hand side bar we&#8217;ve used the <a href="http://en.support.wordpress.com/widgets/custom-menu-widget/">custom menu widget</a> to list the tags and categories to provide access to the publications divided by topic and publication type as HTML and as a feed (just add /feed to the end of the URL). Also on the left, the <a href="http://wordpress.org/extend/plugins/list-authors/">List Authors</a> plugin gives us links to publications by author.</p>
<p>In order to provide a preview of the publication in the post I used the <a href="http://wordpress.org/extend/plugins/tgn-embed-everything/">TGN embed everything</a> plugin. The only problem is that the &#8220;preview&#8221; is too good: it&#8217;s readable but not the highest quality, so it might lead some people to think that we&#8217;re disseminating low quality versions of the papers, whereas we do include links to high quality downloadable files.</p>
<p>The built-in WordPress search is rubbish. For example, it doesn&#8217;t include the author field in the search (not that the first thing we tested was vanity searching), and the results returned are sorted by date not relevance. Happily the <a href="http://wordpress.org/extend/plugins/relevanssi/">relevanssi</a> plugin provides all the search we need.</p>
<p>Finally a few tweaks. We chose URL patterns that avoid unnecessary cruft, and closed comments to avoid spam. We installed the <a href="http://wordpress.org/extend/plugins/google-analytics-for-wordpress/">Google analytics plugin</a>, so we know what you&#8217;re doing on our site, and the <a href="http://wordpress.org/extend/plugins/login-lock/">login lock plugin</a> for a bit of security. The only customisation that we want that couldn&#8217;t be done with a theme option or plugin was providing some context to the multi-post pages. These are pages like the list of <a href="http://publications.cetis.ac.uk/">all the publications</a>, or <a href="http://publications.cetis.ac.uk/c/briefings">all the briefing papers</a>, and we wanted a heading and some text to explain what that particular cut of our collection was. Some themes do this by default, based on information entered about the tag/catergory/author on which the cut is made, but not Atahualpa. I put a few lines of PHP into the theme&#8217;s index.php template to deal with publication types, but we&#8217;ve yet to do it properly for all possible multipost pages.</p>
<p><strong>And in the end&#8230;</strong><br />
As I said at the top, we&#8217;re happy with this approach; if you have any comment on it, do please leave them below.</p>
<p>One last thing. Using a popular platform like WordPress means that there is a lot of support, and I don&#8217;t just mean a well supported code base and directory of plugins and themes. One of the most useful sources of support has been the WordPress community, especially the <a href="http://www.wpscotland.org/">local group</a> of <a href="http://wpuk.org/">WPUK</a>, at whose meet-ups I get burritos and advice on themes, plugins, security and all things wordpressy.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2012/03/13/cetis-publications-now-on-wordpress/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A reflection for open education week</title>
		<link>http://blogs.cetis.ac.uk/philb/2012/03/08/for-open-education-week/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2012/03/08/for-open-education-week/#comments</comments>
		<pubDate>Thu, 08 Mar 2012 11:28:50 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[oer]]></category>

		<category><![CDATA[ukoer]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=574</guid>
		<description><![CDATA[It&#8217;s open education week, lots of interesting events are happening and lots of reflections being made on what open education means. One set of reflections that caught my eye was a trio of posts from Jisc programme managers David, Amber and Lawrie: three personal attempts to draw a picture of the open education space to [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s <a href="http://www.openeducationweek.org/">open education week</a>, lots of interesting events are happening and lots of reflections being made on what open education means. One set of reflections that caught my eye was a trio of posts from Jisc programme managers <a href="http://followersoftheapocalyp.se/my-openedspace">David</a>, <a href="http://amberthomas.typepad.com/fragments/2012/03/openedspace.html">Amber</a> and <a href="http://lawrie.jiscinvolve.org/wp/2012/03/05/my-openedspace/">Lawrie</a>: three personal attempts to draw a picture of the open education space to answer the question &#8220;what is open education and how does it fit in with everything else?&#8221;. These sprung from an attempt &#8220;to describe the way JISC-funded work is contributing to developing this space&#8221;. They are great. But I think they miss one thing: the time dimension. By a stroke of good luck, Lou Macgill has recently produced an <a href="http://blogs.cetis.ac.uk/othervoices/2012/02/13/open-educational-resources-timeline/">OER Timeline</a> which I think represents this very nicely. (Yes, I know that there is much more to education than resources, and much more open education than OER, but it&#8217;s resource management and dissemination that I mostly work on.)</p>
<p>Maybe it&#8217;s a sign of age, but the changes in approaches to supporting the sharing of content is something that has been interesting me more and more of late.  Nearly two years ago Lorna, John and I produced a paper for the ADL Repositories and Registries Summit called <a href="http://blogs.cetis.ac.uk/lmc/2010/04/16/then-and-now/">Then and Now</a> which highlighted changes in technical approaches to JISC programmes that CETIS had helped support between 2002 and 2010. The desire to share resources had always been there, the change was from a focus on tight technical specifications to one which put openness at the centre. This wasn&#8217;t done for any <a href="http://blogs.cetis.ac.uk/adam/2010/09/15/openness-ideology-or-rationality/">ideological reason</a>, but because we had an aim, &#8220;share stuff&#8221;, and the open approach seemed the one that presents fewest obstacles. I tried to describe the advantages of the open approach in <a href="http://blogs.cetis.ac.uk/philb/2010/09/10/open-closed-case/">An open and closed case for educational resources</a>. </p>
<p>The timeline helps me understand why we are doing OER rather than some other means of solving the problem of how to share content, but that is just one aspect. What I really like about the open approach is that it creates new possibilities as well as solving old problems. So as well as a timeline of solutions what we should have is a timeline that shows what we are trying to do, one which shows the changing aims as well as the changing solutions, and that I think would show a trend to Open Education.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2012/03/08/for-open-education-week/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Badges at the CETIS conference 2012</title>
		<link>http://blogs.cetis.ac.uk/philb/2012/02/29/badges/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2012/02/29/badges/#comments</comments>
		<pubDate>Wed, 29 Feb 2012 14:58:19 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[competences]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=559</guid>
		<description><![CDATA[Mozilla open badges that is. 
Simon and I organised a session &#8220;Are open badges the future for recognition of skills?&#8221; for the CETIS conference last week, with more than a little help from Doug Belshaw. As described in more detail on the session&#8217;s wiki page, the programme was simple: presentations from Doug and Simon followed [...]]]></description>
			<content:encoded><![CDATA[<p><a href="https://wiki.mozilla.org/Badges">Mozilla open badges</a> that is. </p>
<p>Simon and I organised a session &#8220;<a href="http://wiki.cetis.ac.uk/Are_open_badges_the_future_for_recognition_of_skills%3F">Are open badges the future for recognition of skills?</a>&#8221; for the CETIS conference last week, with more than a little help from <a href="http://about.me/dajbelshaw">Doug Belshaw</a>. As described in more detail on <a href="http://wiki.cetis.ac.uk/Are_open_badges_the_future_for_recognition_of_skills%3F">the session&#8217;s wiki page</a>, the programme was simple: presentations from Doug and Simon followed by discussion structured around a SWOT analysis for the use of badges in two scenarios.</p>
<p><a href="http://dajbconf.posterous.com/jisc-cetis-conference-2012">Doug&#8217;s conference blog</a> has his slides, audio recording and his own reflections.  One of the highlights for me was almost incidental to badges: I hadn&#8217;t come across the idea of &#8220;stealth assessment&#8221; before. Simply put, stealth assessment involves monitoring what people achieve and then telling them what it qualifies them for. So a young child might be told that they have just swum 10m and now qualify for an achievement badge (kinder than putting children through the stress of pre-arranged assessments).</p>
<p>If Doug&#8217;s presentation was about &#8220;why?&#8221; Simon&#8217;s was about &#8220;how?&#8221;. His presented <a href="http://www.slideshare.net/asimong/some-requirements-for-badge-systems">some requirements for badge systems</a>, which also considers how close the Mozilla open badge framework comes to fulfilling these requirements.</p>
<p>The second half of the session was spent in group discussion structured around a SWOT analysis of two scenarios outlined by the groups:</p>
<p><strong>Scenario 1, formative assessment in a high stakes field (medical education)</strong></p>
<p><strong>Strengths:</strong></p>
<ul>
<li>Assessment can be continuous, accreditation expires if not renewed.</li>
<li>Badges are machine processable as well as human readable.</li>
<li>Cumulative, can show progress being made towards degree</li>
</ul>
<p><strong>Weaknesses:</strong></p>
<ul>
<li>New, and therefore not trusted</li>
</ul>
<p><strong>Opportunities:</strong></p>
<ul>
<li>Works well with highly competitive students (e.g. medics)</li>
<li>Could be transferred between institutions</li>
</ul>
<p><strong>Threats:</strong></p>
<ul>
<li>Perception of being trivial</li>
<li>Unwelcome addition to current systems</li>
</ul>
<p><strong>Scenario 2, within a community of practice</strong></p>
<p><strong>Strengths:</strong></p>
<ul>
<li>Recognition by community of practice</li>
<li>Transferability to other communities</li>
</ul>
<p><strong>Weaknesses:</strong></p>
<ul>
<li>Lack of context (range, scope etc.)</li>
<li>People unwilling to dig into detail provided</li>
<li>Unclear governance</li>
<li>Proliferation</li>
</ul>
<p><strong>Opportunities:</strong></p>
<ul>
<li>Currency outside the community</li>
<li>Could include qualifications</li>
<li>Branding opportunities</li>
<li>Invitation to examine evidence in detail</li>
</ul>
<p><strong>Threats</strong></p>
<ul>
<li>Over simplification</li>
<li>Brand recognition dominates quality</li>
<li>Issued by inappropriate bodies</li>
</ul>
<p>There was a lot of discussion, and I can&#8217;t really do it justice here; I shall mention only a couple of comments. First one reported on Doug&#8217;s blog:</p>
<blockquote><p>&#8220;We&#8217;re sick to death of hearing that X, Y or Z is going to change the world. Accept that it isn&#8217;t and move on.&#8221;</p></blockquote>
<p>Hmm, yes, that may be a fair point. OTOH sometimes something does change the world, or at least parts of it, and it is important to keep a watch for the things that might, so I&#8217;ve no regrets about being involved in this session on that count.</p>
<p>The other comment came in the form of of tweets from <a href="https://twitter.com/#!/lawrie">Lawrie Phipps</a>:</p>
<p><img src="http://blogs.cetis.ac.uk/philb/files/2012/02/lawrietweet1.png" alt="Edging toward believing that &quot;badges&quot; may be a solution to a problem we&#39;ve almost fixed with other things #cetis12" width="509" height="74" class="size-full wp-image-565" /></p>
<p><img src="http://blogs.cetis.ac.uk/philb/files/2012/02/lawrietweet2.png" alt="thinking about open badges, surely once they&#39;re accepted by everyone, they bcome institutionalised, and look like something we have now?" width="513" height="86" class="size-full wp-image-564" /></p>
<p>Sentiments that I have some sympathy with, however it has happened that you think you have <a href="http://en.wikipedia.org/wiki/Coloured_Book_protocols">solved a problem locally</a> only for  some external solution to come along and get adopted widely enough to be significant. So, while we don&#8217;t have an answer to the question we posed in the session title, I think open badges are looking relevant enough for it to be important that CETIS keep a watching brief on them.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2012/02/29/badges/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A lesson in tagging for UKOER</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/11/06/a-lesson-in-tagging-for-ukoer/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/11/06/a-lesson-in-tagging-for-ukoer/#comments</comments>
		<pubDate>Sun, 06 Nov 2011 19:38:30 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[oer]]></category>

		<category><![CDATA[resource description]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=556</guid>
		<description><![CDATA[We&#8217;ve been encouraging projects in the HE Academy / JISC OER programmes to use  platforms that help get the resources out onto the open web and into the places where people look, rather than expecting people to come to them. YouTube is one such place. However, we also wanted to be able to find [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve been encouraging projects in the HE Academy / JISC OER programmes to use  platforms that help get the resources out onto the open web and into the places where people look, rather than expecting people to come to them. YouTube is one such place. However, we also wanted to be able to find all the outputs from the various projects wherever they had been put, without relying on a central registry, so one of the technical recommendations for the programme was that resources are tagged UKOER.</p>
<p>So, I went to YouTube and <a href="http://www.youtube.com/results?search_query=ukoer&amp;aq=f">searched for UKOER</a>, and <a href="http://www.youtube.com/watch?v=0vpc87yIXlg">this</a> was the top hit. Well, it&#8217;s a lesson in tagging I suppose. I don&#8217;t think it invalidates the approach, we never expected 100% fidelity and this seems to be a one-off among the first 100 or so of the 500+ results.  And it&#8217;s great to see results from Chemistry.FM and CoreMaterials topping 10,000 views.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/11/06/a-lesson-in-tagging-for-ukoer/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Text and Data Mining workshop, London 21 Oct 2011</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/10/21/text-and-data-mining-workshop-london-21-oct-2011/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/10/21/text-and-data-mining-workshop-london-21-oct-2011/#comments</comments>
		<pubDate>Fri, 21 Oct 2011 15:06:42 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[achievement information]]></category>

		<category><![CDATA[aggregated content]]></category>

		<category><![CDATA[architecture]]></category>

		<category><![CDATA[educational content]]></category>

		<category><![CDATA[oer]]></category>

		<category><![CDATA[open content]]></category>

		<category><![CDATA[resource description]]></category>

		<category><![CDATA[semantic technologies]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=552</guid>
		<description><![CDATA[ There were two themes running through this workshop organised by the Strategic Content Alliance: technical potential and legal barriers. An important piece of background is the Hargreaves report.
The potential of text and data mining is probably well understood in technical circles, and were well articulated by JohnMcNaught of NaCTeM. Briefly the potential lies in [...]]]></description>
			<content:encoded><![CDATA[<p> There were two themes running through this workshop organised by the Strategic Content Alliance: technical potential and legal barriers. An important piece of background is the <a href="http://www.ipo.gov.uk/ipreview.htm">Hargreaves report</a>.</p>
<p>The potential of text and data mining is probably well understood in technical circles, and were well articulated by JohnMcNaught of NaCTeM. Briefly the potential lies in the extraction of new knowledge from old through the ability to surface implicit knowledge and show semantic relationships. This is something that could not be done by humans, not even crowds, because of volume of information involved. Full text access is crucial, John cited a finding that only 7% of the subject information extracted from research papers was mentioned in the abstract. There was a strong emphasis, from for example Jeff Lynn of the Coalition for a digital economy and Philip Ditchfield of GSK, on the need for business and enterprise to be able to realise this potential if it were to remain competetive.</p>
<p>While these speakers touched on the legal barriers it was Naomi Korn who gave them a full airing. They start in the process of publishing (or before) when publishers acquire copyright, or a licence to publish with enough restriction to be equivalent. The problem  is that the first step of text mining is to make a copy of the work in a suitable format. Even for works licensed under the most liberal open access licence academic authors are likely to use, CC-by, this requires attribution. Naomi spoke of attribution stacking, a problem John had mentioned when a result is found by mining 1000s of papers: do you  have to attribute all of them? This sort of problem occurs at every step of the text mining process. In UK law there are no copyright exceptions that can apply: it is not covered by fair dealling (though it is covered by fair use in the US and similar exceptions in Norwegian and Japanese law, nowhere else); the exceptions for transient copies (such as in a computers memory when readng on line) only apply if that copy has not intrinsic value.</p>
<p>The Hargreaves report seeks to redress this situation. Copyright and other IP law is meant to promote innovation not stifle it, and copyright is meant to cover creative expressions, not the sort of raw factual information that data mining processes. Ben White of the British Library suggested an extension of fair dealling to permit data mining of legally obtained publications. The important thing is that, as parliament acts on the Hargreaves review people who understand text mining and care about legal issues make sure that any legislation is sufficient to allow innovation, otherwise innovators will have to move to those  jurisdictions like the US, Japan and Norway where the legal barriers are lower (I&#8217;ll call them &#8216;text havens&#8217;). </p>
<p>Thanks to JISC and the SCA on organising this event; there&#8217;s obviously plenty more for them to do.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/10/21/text-and-data-mining-workshop-london-21-oct-2011/feed/</wfw:commentRss>
		</item>
		<item>
		<title>LRMI: after the meeting</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/09/15/lrmi-after-the-meeting/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/09/15/lrmi-after-the-meeting/#comments</comments>
		<pubDate>Thu, 15 Sep 2011 07:35:38 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[CETIS-Content]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=542</guid>
		<description><![CDATA[Last week I was at the first face to face meeting of the Learning Resource Metadata Initiative technical working group, here are my reflections on it. In short, what I said in previous post was about right, and the discussion went the way I hoped. One addition, though, that I didn&#8217;t cover in that post, [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I was at the first face to face meeting of the <a href="http://wiki.creativecommons.org/LRMI">Learning Resource Metadata Initiative</a> technical working group, here are my reflections on it. In short, <a href="http://blogs.cetis.ac.uk/philb/2011/09/08/learning-resource-metadata-initiative/">what I said in previous post</a> was about right, and the discussion went the way I hoped. One addition, though, that I didn&#8217;t cover in that post, was some discussion of accessibility conditions. That was one of a number of issues that was set aside as being of more general importance than learning resources and best dealt with that wider scope in mind; the resources of the LRMI project being better spent on those issues that are specific to learning materials.</p>
<p>An interesting take on the scope of the project that someone (I forget who) raised during the meeting concerns working within the constraints of the search engine interface and results page. Yes, Google, Bing and Yahoo have advanced search interfaces that allow check-box selection of conditions such as licence requirements, they also provide and support for specialist search e.g. Google Scholar and custom search engines; however real success will come if the information that can be marked up as a result of LRMI is effective for people using the default search engine. What this means is that the actions that result from a use case or scenario should be condensed down to a few key words typed into a search box,<br />
<div id="attachment_545" class="wp-caption aligncenter" style="width: 310px"><a href="http://blogs.cetis.ac.uk/philb/files/2011/09/bingsearch.png"><img src="http://blogs.cetis.ac.uk/philb/files/2011/09/bingsearch-300x71.png" alt="Bing search box" width="300" height="71" class="size-medium wp-image-545" /></a><p class="wp-caption-text">Bing search box</p></div><br />
and the information displayed as an outcome should fit into an inch or two of screen space on a search engine results page.<br />
<div id="attachment_549" class="wp-caption aligncenter" style="width: 310px"><a href="http://blogs.cetis.ac.uk/philb/files/2011/09/bingresult.png"><img src="http://blogs.cetis.ac.uk/philb/files/2011/09/bingresult-300x60.png" alt="Bing search result" width="300" height="60" class="size-medium wp-image-549" /></a><p class="wp-caption-text">Bing search result</p></div> </p>
<p>That&#8217;s quite useful in terms of focussing on what is really important, but of course it won&#8217;t meet everyone&#8217;s ambitions for learning resource metadata. The question this raises is to what extent should the schema.org vocabulary attempt to meet these requirements? That, I think, is still an open question, but I am sure that embedded metadata markup such as schema.org has limitations and external metadata such as is provided by the IEEE LOM, Dublin Core and ISO MLR is a complemetary approach that may be necessary in meeting some of the more extensive use cases for learning resource metadata. Indeed, one requirement of LRMI which was raised during the meeting is to provide a means of linking to external metadata. One more observation on this line: at least from the basis of this meeting, it seems that the penetration of standards for educational metadata into the commercial educational publishing world (both online and more conventional) is not great.</p>
<p>A final issue concerning the scope of LRMI, and schema.org more generally, with respect to the use of other approaches to handling metadata is relevant to the idea of linking to external metadata, but is better illustrated by the issue of how to convey licence information. At the moment there is no schema.org term for indicating licence terms, however there is a perfectly good approach advocated by Creative Commons and recognised by Google and many other search and content providers (i.e. a link or anchor with attributes rel=&#8221;license&#8221; href=&#8221;licenceURL&#8221; optionally spanning a textual description of the licence&#8211;no prizes for guessing how I think this could be extended to links to external metadata). Is it helpful to reproduce this in schema.org? On the one hand, one of the aims of schema.org is to offer web masters a unified approach and a single coherent set of recommendations for embedding metadata; on the other hand this approach seems to be in accord with HTML in general and is aready widespread, so perhaps any clarification or coherence in terms of the schema.org offering would be at the expense of muddying and fragmentation of practice with respect to how to embed licence information in HTML more generally.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/09/15/lrmi-after-the-meeting/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Learning resource metadata initiative</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/09/08/learning-resource-metadata-initiative/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/09/08/learning-resource-metadata-initiative/#comments</comments>
		<pubDate>Thu, 08 Sep 2011 03:40:49 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[metadata]]></category>

		<category><![CDATA[oer]]></category>

		<category><![CDATA[resource description]]></category>

		<category><![CDATA[ukoer]]></category>

		<category><![CDATA[lrmi]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=535</guid>
		<description><![CDATA[In the spirit of Godwin&#8217;s law, I would propose that
&#8220;As any discussion about metadata grows longer the probability of a comparison to Google approaches one.&#8221;
Of course the comparison is usually that formal metadata is insignificant for the resource discovery needs of most people when compared to Google.
On one hand this is an over simplification: metadata [...]]]></description>
			<content:encoded><![CDATA[<p>In the spirit of <a href="http://en.m.wikipedia.org/wiki/Godwin%27s_law">Godwin&#8217;s law</a>, I would propose that</p>
<blockquote><p>&#8220;As any discussion about metadata grows longer the probability of a comparison to Google approaches one.&#8221;</p></blockquote>
<p>Of course the comparison is usually that formal metadata is insignificant for the resource discovery needs of most people when compared to Google.</p>
<p>On one hand this is an over simplification: metadata is important for resource management in general not just for resource discovery, the information contained in metadata can be exposed to Google and other search engines, and it helps resource discovery in other ways, for example in displaying relationships between resources that can be browsed and crawled. It remains, however, true that all the effort that has gone into formalising and standardising metadata schema has had little, if any, direct effect on how people find resources through the search engine of their choice. So it&#8217;s interesting that the big search engines are now taking an interest in metadata markup of web pages, first with Google&#8217;s rich snippets, and now the more extensive (in a number of ways) <a href="http://schema.org/">schema.org</a> initiative.  I guess that this approach (that is, marking up the human readable infomation on a web page to show its relationship to a formal metadata schema as opposed to holding it seperately in a purely machine readable format) appeals to search engines because of their suspicion that any information not visible to the reader of a page (e.g. metadata elements in the HTML head element) might be there purely to spam search engine results.</p>
<p>Of course, my interest through CETIS is in educational metadata, and I have already <a href="http://blogs.cetis.ac.uk/philb/2011/03/30/rich-snippets/">dabbled in using rich snippets</a> to mark-up a description of an educational resource. So I was extremely interested to hear about the <a href="http://wiki.creativecommons.org/LRMI">Learning Resource Metadata Initiative</a> headed up by Creative Commons and the Association of Educational Publishers, aiming to apply the schema.org approach to educational resources (schema.org initially, with an RDFa expression planned as a secondary output derived from it). I was extremely pleased to be accepted on to the <a href="http://creativecommons.org/weblog/entry/28719">technical working group</a> to help draw up the details. Tomorrow is the first face to face meeting of that technical group, which is why I writing this on a plane on the way to San Fransisco.</p>
<p>While this will be the first face to face meeting, the technical group has made a start on its work. The previous work in educational metadata has been surveyed; use cases for lrmi have been collected, including those which were submitted for the Dublin Core Education Application Profile; and we&#8217;ve had a couple of teleconference meetings. It&#8217;s early days, so a lot is still open, but this much I can say (but I say it as an individual, I&#8217;m not claiming to be reporting any consensus of the working group). The scope of lrmi is resource discovery, and for me it stands or falls on whether it helps discovery through search engines. With respect to this there does already seem to be some uncertainty (generally) over how search engines will use schema.org and how the governance of the main schema.org vocabulary allows for community-driven additions and usage profiles (there is an <a href="http://www.bing.com/community/site_blogs/b/webmaster/archive/2011/07/25/schema-org-workshop-announced-sept-21-2011.aspx">upcoming schema.org meeting</a> that might help clarify this). However, I guess that in the end it will come down to Google and others using what they find useful and ignoring what that don&#8217;t: which isn&#8217;t a bad way of establishing an industry standard in this field (I see parallels with browser developers and HTML5). The use cases gathered include the usual discovery issues, so far I haven&#8217;t seen anything unexpected, so hopefully the lrmi output will align with other efforts to meet those same scenarios. There is one slight coda to that though, there is a lot of interest in expressing the usefulness of a resource for specific learning objectives as set out in standard curricula. This is largely with respect to showing the alignment of a resource with US state standard curricula, and the US national core K-12 curriculum. I know very little about the US standard curriculum/a, but I do think it is important that (and believe it would be useful) any approach adopted by lrmi to showing this alignment should be usable more generally for, e.g., the English National Curriculum and possibly for wider competency frameworks as used in UK HE for some disciplines (e.g. medecine, Scottish law, engineering). I should stress that, while the level of interest in this is noteworthy, showing such alignments isn&#8217;t new: it&#8217;s achievable with the LOM (classification with purpose set to learning objective), Dublin Core has had the conformsTo term for showing alignment to an educational standard for a number of years, and it has been discussed for the conceptual model for ISO MLR part 5.</p>
<p>I&#8217;ll report more when I am home from the meeting and will, of course, be happy to feed forward any comments you have, but to be kept up to date on all developments and to have a more direct say join the <a href="//groups.google.com/group/lrmi">LRMI discussion group</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/09/08/learning-resource-metadata-initiative/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Testing Caprét</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/08/17/testing-capret/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/08/17/testing-capret/#comments</comments>
		<pubDate>Wed, 17 Aug 2011 12:07:00 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[oer]]></category>

		<category><![CDATA[resource description]]></category>

		<category><![CDATA[semantic technologies]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=521</guid>
		<description><![CDATA[I&#8217;ve been testing the alpha release of CaPRéT , a tool that aids attribution and tracking of openly licensed content from web sites. According to the Caprét website.
When a user cuts and pastes text from a CaPRéT-enabled site:

The user gets the text as originally cut, and if their application supports the pasted text will also [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been testing the alpha release of <a href="http://capret.mitoeit.org/">CaPRéT </a>, a tool that aids attribution and tracking of openly licensed content from web sites. According to the Caprét website.</p>
<blockquote><p>When a user cuts and pastes text from a CaPRéT-enabled site:</p>
<ul>
<li>The user gets the text as originally cut, and if their application supports the pasted text will also automatically include attribution and licensing information.</li>
<li>    The OER site can also track what text was cut, allowing them to better understand how users are using their site.</li>
</ul>
</blockquote>
<p>I tested Caprét  on a single page, <a href="http://www.icbl.hw.ac.uk/~philb/">my institutional home page</a> and on this blog. To enable Caprét for material on a website you need to include links to four javascript files in your webpages. I went with the files hosted on the Caprét site so all I had to do was put this into my homepage&#8217;s <code>&lt;head&gt;</code> (The testing on my home page is easier to describe, since the options for WordPress will depend on the theme you have installed.)</p>
<p><code><br />
    &lt;script src="https://ajax.googleapis.com/ajax/libs/jquery/1.6.2/jquery.min.js" type="text/javascript"&gt;&lt;/script&gt;<br />
    &lt;script src="http://capret.mitoeit.org/js/jquery.plugin.clipboard.js" type="text/javascript"&gt;&lt;/script&gt;<br />
    &lt;script src="http://capret.mitoeit.org/js/oer_license_parser.js" type="text/javascript"&gt;&lt;/script&gt;<br />
    &lt;script src="http://capret.mitoeit.org/js/capret.js" type="text/javascript"&gt;&lt;/script&gt;<br />
</code></p>
<p>Then you need to put the relevant information, properly marked up into the webpage. Currently caprét cites the Title, source URL, Author, and Licence URI of the page from which the text was copied. The easiest way to get this information into your page is to use a platform which generates it automatically, e.g. WordPress or Drupal with the <a href="http://openattribute.com/">OpenAttribute plug-in</a> installed. The next easiest way is to fill out the form at the <a href="http://creativecommons.org/choose/">Creative Commons License generator</a>. Be sure to supply the additional information if you use that form.</p>
<p>If you&#8217;re into manual, this is what does the work:</p>
<p><strong>Title</strong>, is picked up from any text marked as<br />
<code>&lt;span xmlns:dct="http://purl.org/dc/terms/" href="http://purl.org/dc/dcmitype/Text" property="dct:title" rel="dct:type"&gt;&lt;span&gt;</code> or, if that&#8217;s not found, the page <code>&lt;title&gt;</code> in the <code>&lt;head&gt;</code></p>
<p><strong>Source URL</strong> comes from page url</p>
<p><strong>Author name</strong>, is picked up from contents of <code>&lt;a xmlns:cc="http://creativecommons.org/ns#" href="http://jisc.cetis.ac.uk/contact/philb" property="cc:attributionName" rel="cc:attributionURL"&gt;&lt;/a&gt;</code> (actually, the author attribution URL in the href attribute isn&#8217;t currently used, so this could just as well be a span)</p>
<p><strong>Licence URI</strong>, is picked up from the href attribute of <code>&lt;a rel="license" href="http://creativecommons.org/licenses/by/3.0/"&gt;</code></p>
<p>You might want to suggest other things that could be in the attribution/citation.</p>
<p><strong>Reflections</strong><br />
As far as attribution goes it seems to work. Copy something from my home page or this blog and paste it elsewhere and the attribution information should magically appear.  What&#8217;s also there is an <a href="http://wiki.cetis.ac.uk/Web_bugs">embedded tracking gif</a>, but I haven&#8217;t tested whether that is working.</p>
<p>What I like about this approach is that it converts self-description into embedded metadata. Self description is practice of including within a resource that information which is important for describing it: the title, author, date etc. Putting this information into the resource isn&#8217;t rocket science, <a href="http://blogs.cetis.ac.uk/johnr/2008/12/08/open-educational-resources-metadata-and-self-description/">it&#8217;s just good practice</a>. To convert this <a href="http://blogs.cetis.ac.uk/philb/2009/09/14/about-metadata-resource-description-pt-1/">information into metadata</a> it needs to be encoded in such a way that a machine can read it. That&#8217;s where the RDFa comes in. What I like about RDFa (and microformats and microdata) as a way of publishing metadata is that it builds the actual descriptions are the very same ones that it&#8217;s just good practice to include in the resource. Having them on view in the resource is likely to help with quality assurance, and, while the markup is fiddly (and best dealt with by the content management system in use, not created by hand) creating the metadata should be no extra effort over what you should do anyway. </p>
<p>Caprét is being developed by MIT OEIT and Tatemae (OERGlue) as part of the <a href="http://blogs.cetis.ac.uk/lmc/2011/05/09/jisc-cetis-oer-technical-mini-projects-results/">JISC CETIS mini projects initiative</a>; it builds on the browser plug-in developed independently by the <a href="http://openattribute.com/">OpenAttribute</a> team.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/08/17/testing-capret/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Amazon kindle and textbooks</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/08/10/amazon-kindle-and-textbooks/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/08/10/amazon-kindle-and-textbooks/#comments</comments>
		<pubDate>Wed, 10 Aug 2011 09:24:13 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[educational content]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=517</guid>
		<description><![CDATA[Amazon are renting textbooks for the kindle. Over the last couple of months I&#8217;ve been using a Kindle. We bought it with the idea of seeing how it might be useful for educational content, eTextbooks at the most basic level, though I&#8217;ve already written about my misgivings on that score. Well, we quickly came to [...]]]></description>
			<content:encoded><![CDATA[<p>Amazon are <a href="http://www.amazon.com/gp/feature.html/?docId=1000702481">renting textbooks</a> for the kindle. Over the last couple of months I&#8217;ve been using a Kindle. We bought it with the idea of seeing how it might be useful for educational content, eTextbooks at the most basic level, though I&#8217;ve already written about <a href="http://blogs.cetis.ac.uk/philb/2011/01/24/hopes-and-fears-for-ereaders-and-etextbooks/">my misgivings on that score</a>. Well, we quickly came to the conclusion that the Kindle device wasn&#8217;t much good for eTextbooks: no colour, screen refresh too slow for dynamic content, not good for non-linear content (breakout boxes, footnotes, even multiple columns)&#8211;sure it displays pdfs and HTML, but it&#8217;s difficult to get difficult to get a magnification that works well and navigating around the page is clunky, and it doesn&#8217;t do ePub. But it&#8217;s fine for novels and there may be some educational utility in the bookmarking, note making and the sharing of these that is possible (though making notes using the kindle isn&#8217;t a great user experience). Anyway, given all that it&#8217;s interesting to note that the textbooks shown in the Amazon ad I mention above rely on colour, are non-linear, and both of them would be really engaging if the diagrams were interactive or even just animated. Neither of them are being displayed on a Kindle.</p>
<p>[Aside: textbook rental might be an attractive idea for some, but pricing based on a typical rental of 30 days!?]</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/08/10/amazon-kindle-and-textbooks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The hunting of the OER</title>
		<link>http://blogs.cetis.ac.uk/philb/2011/05/27/the-hunting-of-the-oer/</link>
		<comments>http://blogs.cetis.ac.uk/philb/2011/05/27/the-hunting-of-the-oer/#comments</comments>
		<pubDate>Fri, 27 May 2011 13:38:15 +0000</pubDate>
		<dc:creator>philb</dc:creator>
		
		<category><![CDATA[oer]]></category>

		<category><![CDATA[repositories]]></category>

		<category><![CDATA[ukoer]]></category>

		<guid isPermaLink="false">http://blogs.cetis.ac.uk/philb/?p=509</guid>
		<description><![CDATA[&#8220;As internet resources are being moved, they can no longer be traced.&#8221; I read in a press release from Knowledge Exchange. This struck me as important for OERs since part of their &#8220;openness&#8221; is the licence to copy them, and I have recently been on something of an OER hunt, which highlights the importance of [...]]]></description>
			<content:encoded><![CDATA[<p>&#8220;As internet resources are being moved, they can no longer be traced.&#8221; I read in a <a href="http://www.knowledge-exchange.info/Default.aspx?ID=62&amp;M=News&amp;PID=2379&amp;NewsID=114">press release</a> from <a href="http://www.knowledge-exchange.info/">Knowledge Exchange</a>. This struck me as important for OERs since part of their &#8220;openness&#8221; is the licence to copy them, and I have recently been on something of an OER hunt, which highlights the importance of using identifiers correctly and of &#8220;curatorial responsibility&#8221;.</p>
<p>The OER I was hunting was an &#8220;Interactive timeline on Anglo-Dutch relations (50 BC to 1830)&#8221; from the UKOER <a href="http://www.ucl.ac.uk/dutch/OER">Open Dutch project</a>. It was recommended at a year or so ago as great output which pretty much anyone could see the utility of that used the MIT SIMILE timeline software to create a really engaging interface. I liked it, but more importantly for what I&#8217;m considering now I used it as an example when investigating whether putting resources into a repository enhanced their visibility on Google (in this case it did).</p>
<p>Well, that was a year+ ago. The other week I wanted to find it again. So I went to <a href="http://www.google.com/search?q=anglo+dutch+timeline&amp;gl=uk">Google and searched for &#8220;anglo dutch timeline&#8221;</a> (without the quotes). Sure enough, I got three results for the one I am looking for on the first page (of course, your results my vary; Google&#8217;s like that now-a-days). These were, from the bottom up:</p>
<ol>
<li>A link to <a href="https://dspace.ndlr.ie/jspui/handle/10633/16285">a record in the NDLR</a> (the Irish National Digital Learning Resources Repository) which gave the link URL as http://open.jorum.ac.uk:80/xmlui/handle/123456789/517 (see below)</li>
<li>A link to <a href="http://humbox.ac.uk/2404/">a resource page in HumBox</a>, which turned out to be a manifest-only content package (i.e. metadata in a zip file). Looking into it, there&#8217;s no resource location given in the metadata, and the pointer to the content (which should be the resource being described) actually points to the Open Dutch home page.</li>
<li>Finally, a link to <a href="http://open.jorum.ac.uk/xmlui/handle/123456789/517">a resource page in JORUM</a>. This also describes the resource I was looking for but actually points to Open Dutch project page. The URL for Jorum page describing the resource is given as the persistent link&#8211;I believe that the NDLR harvests metadata from Jorum, so my guess is that that is why NDLR list this as the location of the resource.</li>
</ol>
<p>Finding descriptions of a resource isn&#8217;t really helpful to many people. OK, I now know the full name and the author of the resource, which might help me track down the resource, but at this point I couldn&#8217;t. Furthermore, nobody wants to find a description of a resource that links to a description of the resource. I think one lesson concerns the <strong>importance of identifiers</strong>: &#8220;describe the thing you identify; identify the thing you describe.&#8221;</p>
<p>This story (and I very much suspect it is not an isolated case) has significance for debates about whether repositories should accept metadata-only &#8220;representations&#8221; of resources. Whether or not it is a good idea to deposit resources you are releasing as OERs in a third-party repository will depend on what you want to achieve by releasing them; whether or not it is a good idea for a repository to take and store resources from third parties will depend on what the repository&#8217;s sponsors wish to facilitate.  Either way, someone needs to take some <strong>curatorial responsibility</strong> for the resource and for the metadata about it. That means on the one hand making sure that the resource stays on the web and on the other hand making sure that the metadata record continues to point to the right resource (automatic link checking for HTTP 404 responses etc. helps but, as this <a href="http://blog.pinboard.in/2011/05/remembrance_of_links_past/">post on link rot</a> notes, it&#8217;s not always that simple).</p>
<p>By the way, thanks to the incomparable David Kernohan, I now know that the timeline is currently at <a href="http://www.ucl.ac.uk/alternative-languages/OER/timeline/">http://www.ucl.ac.uk/alternative-languages/OER/timeline/</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.cetis.ac.uk/philb/2011/05/27/the-hunting-of-the-oer/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>

