D-Lib Magazine
|
|
Ben Lund, Tony Hammond, Martin Flack and Timo Hannay |
IntroductionConnotea [1] is a free online reference management and social bookmarking service for scientists created by Nature Publishing Group [2]. While somewhat experimental in nature, Connotea already has a large and growing number of users, and is a real, fully functioning service [3]. The label 'experimental' is not meant to imply that the service is any way ephemeral or esoteric, rather that the concept of social bookmarking itself and the application of that concept to reference management are both recent developments. Connotea is under active development, and we are still in the process of discovering how people will use it. In addition to Connotea being a free and public service, the core code is freely available under an open source license [4]. Connotea was conceived from the outset as an online, social tool. Seeing the possibilities that del.icio.us [5] was opening up for its users in the area of general web linking, we realised that scholarly reference management was a similar problem space. Connotea was designed and developed late in 2004, and soft-launched at the end of December 2004. Usage has grown over the past several months, to the point where there is now enough data in the system for interesting second-order effects to emerge. This paper will start by giving an overview of Connotea, and will outline the key concepts and describe its main features. We will then take the reader on a brief guided tour, show some of the aforementioned second-order effects, and end with a discussion of Connotea's likely future direction. The Key Concepts behind ConnoteaUnashamedly inspired by del.icio.us as we were, we felt there were some additional characteristics of academic reference lists that warranted a separate, tailored service catering to, and making the most of, the needs of the academic community. Therefore, Connotea is a meld of existing reference management conventions and new social bookmarking concepts that are explored in a companion paper [6]. While none of these concepts is original to Connotea, the combination is. To spell them out: 1. Online storage of references and bookmarksThe current standard practice in personal reference management is to have a database of citations stored locally on a user's own computer, and manipulated using dedicated desktop software. Moving the database online has a number of advantages: it makes the collection available from any web-enabled computer, and it allows easy sharing of personal collections (or subsets of them). Storing references online also opens up some new possibilities direct linking into the literature, for example and is the foundation that supports the other three pillars of Connotea (see below). Of course, online storage has its disadvantages too: the responsiveness and usability of the application, for example, and the fact that the availability of your data is in the hands of others. To alleviate these risks, we therefore intend to allow easy import and export between Connotea and other systems, including desktop reference management tools. 2. Simple, non-hierarchical organisingInstead of placing reference material in folders, or folders within folders, the organisational apparatus of Connotea creates a totally flat, but multi-faceted, space. The data can be viewed from the perspective of tags, or users, or links. The 'tags' of Connotea and other social bookmarking tools often lead commentators to decry the anarchy of unconstrained keywords, but this overlooks the fact that tags are intended first and foremost as a way for individual users to manage their own collections. In this way, they are similar in purpose to folder names in computer file systems. However, instead of creating sub-levels of organisation by nesting folders hierarchically, flat tagging achieves this by assigning multiple tags to each item, each tag being treated equally. This releases the user from some of the constraints of a traditional file system, and a web-based interface allows easy navigation of material organised in this way. 3. Opening the list to othersAt the time of writing, all bookmarks posted to Connotea are visible to all registered users and visitors. This takes the concept of sharing to a new level, but also brings new opportunities. We recognise that some users will want to keep some of what they are reading personal, so private bookmarks will soon be added as a feature, though the default will be to make the bookmark public. The main benefits of openness come not just from the ease with which it allows explicit sharing with friends and colleagues, but from many users storing their bookmarks in the same place. This allows Connotea to automatically discover and present connections between users. For example, if someone else has bookmarked the same things as you, that person's library will be a good candidate for a place to find interesting new content. In addition, shared lists allow more sophisticated collaborative filtering algorithms to make recommendations of the form "people who bookmarked this also bookmarked...". 4. Auto-discovery of bibliographic informationIn the spirit of making it quick and easy to use, Connotea attempts to find and import the bibliographic information for any article or book that is added. In many cases, this information is freely available on the web, so why should a user have to retype it? There is also, of course, the benefit of eliminating typing errors. Along with the use of bookmarklets (see below), automatically adding the bibliographic information reduces the workload of saving a reference to two clicks and a small amount of typing for tag names. The Key Features of ConnoteaWhat do these concepts mean for the day-to-day activities of a practicing researcher and their use of Connotea? Here we describe the main features of Connotea that implement the concepts discussed above. 1. BookmarkletsFirstly, users need to be able to quickly and easily switch to their bookmark manager while in the middle of another task. For this purpose, Connotea offers bookmarklets snippets of JavaScript code that can be saved in a browser to provide custom functionality [n1]. The most important of these is a bookmarklet that allows a user to add to Connotea the web page they are currently viewing. Figure 1 shows this in action using the Firefox web browser (this should work in other common browsers too).
The user clicks the bookmarklet, the URL of the page is sent to Connotea, and Connotea presents a form for adding the page to the user's collection. 2. Recognising URLs from common archives and importing bibliographic dataFigure 1 also illustrates the second key feature of Connotea. When a URL is added to Connotea, it is first analysed in order to determine whether it belongs to the set of URLs that Connotea recognises. This can trigger a special behaviour for those web pages that represent academic articles or books bibliographic data for the reference material is collected and added to Connotea. For example, for scholarly articles, Connotea stores the publication name, volume and issue numbers, publication date and the list of authors for the article. At the time of writing, Connotea supports four different article archives and websites [n2], and the Amazon websites for books. This bibliographic data collection process is accomplished by means of a plug-in system. Each Connotea plug-in is specific to a particular archive or website, and must be able to follow the steps outlined in Fig. 2. We are happy to accept plug-ins written by third parties; volunteers should either contact the authors for details of the API (application programming interface) or refer to the Connotea source code and accompanying documentation [4]. As an example, we will now walk you through the bibliographic information import process for a particular PubMed article in this case about the discovery of new hominid fossils attributed to the ancient species Ardipithecus ramidus. The abstract resides at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&
First of all, we can recognise this as an article from PubMed by the domain portion of the URL '
3. TaggingOnce the URL has been sent to Connotea and the bibliographic import process has been completed, the user can add personalised information. The most essential information is the list of tags to associate with the article. Tags are the means by which references are organised in Connotea. Suitable tags should therefore be meaningful in the context of that particular article and that user. For this reason, Connotea allows tags to be almost anything (including both single words and phrases). As discussed above, tags can be thought of as a list of categories for the article, or as folder names, albeit without the potential inconvenience of hierarchy and with the bonus of being able to store the article simultaneously in multiple folders. Figure 3 shows a Connotea window for adding an article. The article in question has been identified, and a few suitable tags have been entered. There is also an option to add a personal description of the resource being bookmarked.
4. CommentsThe other noteworthy piece of personal data is the user's comment. Each user can comment any number of times on any bookmark in their library, and comments from different users are combined to display a chronological, and conversational, thread about a resource. Figure 4 shows a list of comments drawn from the contributions of individual users. Connotea also has a bookmarklet (see Fig. 1) for adding a new comment about the current web page. The idea is that when a user is viewing an article that they already have in their Connotea library, they can quickly and easily add a public note about it.
5. RSS
The bookmarks stored in Connotea can be viewed and navigated by user, by tag, or by a combination of users and tags. The tag and user criteria can be combined using Boolean operators, which allows Connotea visitors to view, for example, all bookmarks created by users ' A Guided Tour
In this section we conduct a brief guided tour of Connotea's uses and functions. In the spirit of the web- and article-centric nature of Connotea, we start by bookmarking this very paper. To do this, either click on the ' http://www.connotea.org/add?uri=http://www.dlib.org/dlib/april05/lund/04lund.html
If you are already logged-in, you will be taken to a form for adding this paper. More likely, if you are not a registered user you will need to register for an account (which is required to post items, but not to browse them). This process takes only a minute or two, and when completed you will then be taken to the bookmarking form. The only required task to perform at this stage is to add some tags, the choice of which is, of course, entirely up to you, the user. We chose ' Now that we have this paper in our library, we can use it as a starting point for exploring the rest of the site and the content. Let us find out who else has bookmarked this paper. Go to your library at http://www.connotea.org/library
and click the ' http://www.connotea.org/uri/685b90ae66cfbc3fc8ebeed0a5def571 This page lists the distinct 'bookmarking events' for the paper, so although the paper title appears many times on the page, each item shows a different user who has bookmarked it, and the tags they used to categorise it.
These tags offer another perspective on the content of Connotea. Click on a tag name and a list of articles that have been categorised using that tag is displayed. Optionally, this list can be restricted to articles posted just by one particular user, and so provide a convenient way of navigating one's own collection (or, for that matter, the collection of another person). Tag-filtered views also provide a simple content discovery route: browsing through lists of bookmarks by hopping between co-occurring tags. For example, at the time of writing, a visitor to Connotea viewing articles tagged (by any user) with ' http://www.connotea.org/tag/open%20access If you have been tag-browsing, return to your library at http://www.connotea.org/library where (if you have bookmarked it as described above) you should again see this paper in the list. If any user has commented on the paper, the total number of comments will be displayed and linked below the title and bibliographic information. Clicking on http://www.connotea.org/comments/uri/685b90ae66cfbc3fc8ebeed0a5def571 will take you to the comments list. See Fig. 4 for a similar list. If you are logged in, a form for submitting a new comment appears at the bottom of the list. We encourage readers of this paper to contribute to the discussion there.
The first comment by user '
As described in the companion paper [6], judicious use of tagging can be used to enable online sharing of reference lists for papers (or book chapters or lectures or meetings) and also to facilitate subsequent sharing of new, related links among readers (or students or attendees). One of the tags in the tag list for ' http://www.connotea.org/user/ben/tag/dlib-connotea-refs
(See Fig. 7 for a screenshot of a similar page.) This is a 'closed' list intended to contain only links to the references used in this paper. The closed nature of the list is enforced by selecting only articles that have been bookmarked with the tag '
The materials in the reference list are also tagged with ' http://www.hyperorg.com/blogger/mtarchive/003840.html is a weblog entry relevant to this paper, although it does not appear in the reference list. Clicking on http://www.connotea.org/add?uri=http://www.hyperorg.com/blogger/mtarchive/003840.html
and assigning ' Other readers can then see new items posted to this community list by visiting http://www.connotea.org/tag/dlib-connotea Note that in this case, unlike that above, we have not specified any user in the query, so you will see items that have been posted by all Connotea users. If you prefer to see new items posted only by the author of this paper then go instead to http://www.connotea.org/user/ben/tag/dlib-connotea
Of course, another username or even several usernames can be used in place of ' Second-Order EffectsWhile the primary goal for Connotea is for it to be useful as a personal bookmarking and reference management tool, some other features and uses emerge as a consequence of pooling this information, and also as a result of the way that individuals use Connotea. Emergent properties that have become apparent to us so far include tag convergence, recommendations, and directory creation. Tag ConvergenceThe term 'folksonomy' [9] refers to the vocabulary, or list of terms, that emerges from the overlapping usage of tags between users. When bookmarking a page, users will often choose their tags deliberately to converge with those of other users in order to gain the discovery benefits described above. In addition, Connotea's 'rename tag' feature not only allows individual users to change their mind about the way they organise their personal libraries, it also facilitates the development of shared tags, by allowing users to change their own taxonomies to better fit in with those of other users [n5]. Although still at an early stage of development, we are beginning to see the emergence of a folksonomy in Connotea. Of the 3359 unique tag names in Connotea at the time of writing, 460 (14%) of them are shared by two or more users, 188 (6%) by three or more, and 109 (3%) by four or more. In fact, the distribution follows a classic power law, as shown in Fig. 6.
Implicit RecommendationBy analysing the data voluntarily supplied by users, Connotea is able to offer them a way of finding new and related content. In other words, Connotea is not only a place to store your bookmarks but is also a community-driven recommendation system. The Connotea interface is hyperlink-oriented bookmark titles are linked to the external web pages, user names are linked to their libraries, and tag names are linked to lists of bookmarks to which that tag has been applied. These links provide one-click access to related material. In addition, and wherever possible, Connotea presents a list of related tags. These are tags that have been co-assigned at some point with the tags that are associated with the current list of bookmarks. Note that it is the users themselves who drive this their tagging behaviour is the foundation of the 'related tags' algorithm. Figure 7 shows a real example of this.
In addition to using co-assignment of tags as a relation indicator, Connotea also employs common bookmarking events as a route into related content. Once again, this is entirely user-driven if two users bookmark the same article, then their interests clearly overlap to some degree. The best place to see this behaviour in action is on an individual user's library page. In the right-hand column of that page there is a list of related users clicking on those user names takes you to their libraries. Semi-Automatic Directory PagesPerhaps the most surprising emergent feature of Connotea is its rise to prominence in Google search results for search terms that coincide with certain Connotea tag names. Because all tag names are hyperlinked in Connotea, and because those links lead to pages listing material that has been bookmarked using that tag, Connotea becomes a user-generated directory of relevant and related resources, and one that Google's ranking algorithm often considers authoritative. For example, at the time of writing a search on Google for 'Drosophila trachea' http://www.google.com/search?q=Drosophila+trachea gives the Connotea bookmark list for that tag as the first result. The term 'microflora' and various species names of ancient hominids that have been used as tag names are other good examples of this while not claiming the number one spot, the relevant Connotea pages rank highly. Future Plans and ProspectsThe Connotea we have described here is the one that exists at the time of writing; it is not the same Connotea that will be available in a few days, weeks or months hence. Indeed, one of the major benefits web-based applications have over their desktop counterparts is that it is relatively quick and easy to roll out new functionality and fixes. Therefore updates can be very frequent sometimes even occurring daily. In this spirit we have a number of new features planned for release in the near future. Meeting the NeedsAs alluded to in our overview of Connotea, we plan to add the ability to keep certain bookmarks private viewable only by the user who posted them. While this removes certain benefits to others of bookmark sharing, we hope that it will also encourage some users to participate who might not otherwise use the system at all, and that those users will make at least some of their bookmarks public. Such users can, of course, still benefit from other users' public bookmarks. We also plan to expand the number of sites whose URLs Connotea recognises. As Connotea understands more, it will retro-actively apply this knowledge to links that have been already posted, turning what were once simple bookmarks into full references. In addition, we are adding RIS and BibTeX import and export functionality to enable researchers to more closely co-ordinate their Connotea collection with their desktop applications and pre-existing reference stores. As well as integrating with user's existing reference stores, it is important for Connotea to be able to integrate with the way they gain access to scholarly information. For this reason, we plan to support OpenURL linking from Connotea. Another plan is to enable user groups. This would allow, for example, a research team to manage their reference lists collaboratively and selectively view recommendations generated only from within the team. Above all, in deploying these and other future developments, we will be listening to feedback from Connotea users. Our priorities remain flexible and we are open to suggestions for other features that users would like to see. The Open Source OpportunityIn addition to these development plans, we have also released Connotea's code under the open-source GNU General Public License [11]. The code, and related documentation, is available from http://connotea.sourceforge.net/ Some might ask why we would consider offering potential competitors a head start by releasing our code in this manner. There are a number of reasons. One is the fact that we have already received several requests for the code from individuals and organisations whose feedback we would value highly. For example, one early adopter, Los Alamos National Laboratory, Research Library, is already experimenting with the Connotea code for possible deployment as an internal application. We invite others to explore similar possibilities and, where possible, supply us with feedback so that we can improve the publicly available implementation and code base. In addition, opening up the source code will, we hope, help other communities, also non-scientific, to set up social bookmarking services of their own. And if any of these groups are able to offer us their own suggestions or improvements, then this will also benefit users of Connotea. Finally, opening up the code allows webmasters to inspect it and to discover exactly how their own URLs are being treated. Those with sufficient motivation and programming expertise might even offer their own improvements, thus benefiting not only themselves but all Connotea users. Similarly, where a suitable plug-in for a website does not yet exist, the openness of the code should help webmasters or other interested developers to contribute one. Thus, by releasing the code we hope to be able to expand the number of sites from which Connotea can extract bibliographic information at a faster rate than we would be able to manage by keeping all development work in-house. But above all, we believe that releasing the code is firmly in keeping with the spirit of Connotea as a piece of community software. It is easier to ask users to submit and share their data if we are willing to publish and share our code. Moreover, the main value of Connotea (and other social bookmarking services) lies not in the technology that runs them, but in the communities that use them. Anything that enhances the Connotea user community whether that is providing them access to the source code or responding to their requests for new features enhances Connotea. AcknowledgementsThe authors would like to thank Herbert Van de Sompel of the Los Alamos National Laboratory, Research Library and Tom Coates of BBC Radio and Music Interactive for reviewing this paper and for providing us with helpful insights and feedback. Notesn1. See the companion paper [6] for a fuller explanation of the use of bookmarklets in social bookmarking applications. n2. The article archives and websites that Connotea currently supports are:
n3. The movie showing the bookmarking process is available in the following formats:
n4. If Connotea has the bibliographic information for an article, the RSS feed includes this. See [8] for more information about publication metadata in RSS. n5. See, for example, Jon Udell's screencast (or video tour) about del.icio.us [10]. References1. Connotea, (application). <http://www.connotea.org/>. 2. Nature Publishing Group. <http://npg.nature.com/>. 3. Lund, B., Scott, J., Hammond, T., and Hannay, T., Connotea: A Free Online Reference Management Service for Scientists, D-Lib Magazine 11(3), March 2005. <http://www.dlib.org/dlib/march05/03inbrief.html#HAMMOND>. 4. Connotea, (code distribution). <http://connotea.sourceforge.net/>. 5. del.icio.us. <http://del.icio.us/>. 6. Hammond, T., Hannay, T., Lund, B., and Scott, J., Social Bookmarking Tools (I): A General Review, D-Lib Magazine 11(4), April 2005. <http://www.dlib.org/dlib/april05/hammond/04hammond.html>. 7. Entrez Programming Utilities. <http://www.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html>. 8. Hammond, T., Hannay, T., and Lund, B., The Role of RSS in Science Publishing: Syndication and Annotation on the Web, D-Lib Magazine 10(12), December 2004. <http://www.dlib.org/dlib/december04/hammond/12hammond.html>. 9. Folksonomy, (definition of). <http://en.wikipedia.org/wiki/Folksonomy>. 10. Udell, J., del.icio.us: the screencast, Jon Udell's Weblog, March 14, 2005. <http://weblog.infoworld.com/udell/2005/03/14.html>. 11. GNU General Public License. <http://www.gnu.org/copyleft/gpl.html>. Copyright © 2005 Ben Lund, Tony Hammond, Martin Flack and Timo Hannay |
|
|
|
Top | Contents | |
| |
D-Lib Magazine Access Terms and Conditions doi:10.1045/april2005-lund
|