<?xml version="1.0" encoding="us-ascii"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article article-type="research-article" xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">LIBER</journal-id>
<journal-title-group>
<journal-title>LIBER QUARTERLY</journal-title>
</journal-title-group>
<issn pub-type="epub">2213-056X</issn>
<publisher>
<publisher-name>Uopen Journals</publisher-name>
<publisher-loc>Utrecht, The Netherlands</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">lq.10221</article-id>
<article-id pub-id-type="doi">10.18352/lq.10221</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Leave the Browser Behind: Placing Discovery Within the User&#x2019;s Workflow</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">http://orcid.org/0000-0003-1611-5833</contrib-id>
<name>
<surname>Cahoy</surname>
<given-names>Ellysa Stern</given-names>
</name>
<email>ellysa@psu.edu</email>
<xref ref-type="aff" rid="aff1"/>
</contrib>
<aff id="aff1">Penn State University, University Park, Pennsylvania, USA</aff>
</contrib-group>
<pub-date pub-type="epub">
<month>1</month>
<year>2018</year>
</pub-date>
<volume>28</volume>
<fpage>1</fpage>
<lpage>19</lpage>
<permissions>
<copyright-statement>Copyright 2018, The copyright of this article remains with the author</copyright-statement>
<copyright-year>2018</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://www.liberquarterly.eu/article/10.18352/lq.10221"/>
<abstract>
<p>As library discovery and delivery has largely moved online, the scholarly workflow of academic researchers has become increasingly fragmented. Faculty are using a variety of software solutions to find, retrieve, organize, annotate, cite, and share information, including library resources. This article discusses the significance of situating discovery, the act of finding new information, within software central to the research workflow, such as citation management software. The current status of discovery of library resources within citation management software is outlined, along with expected future developments. Utilizing findings from two Mellon funded studies, it explores the challenges users face in managing their individual scholarly workflows, and recommends optimizations designed to mitigate information loss and increase researcher productivity.</p>
</abstract>
<kwd-group>
<kwd>Scholarly workflow</kwd>
<kwd>discovery</kwd>
<kwd>citation management software</kwd>
<kwd>libraries</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1. Introduction</title>
<p>Since the mid 1990s, the Web has been a center for information discovery, retrieval, and creation. Seemingly overnight, the Web changed how library users found and utilized information. Where once there was a physical door to the library, there is now an electronic door in the form of the library&#x2019;s web site, the gateway to library subscription databases and other discovery tools. Yet as the Web (and information technology in general) has matured, finding and retrieving information has become more fragmented. Users previously had two choices not so long ago&#x2014;get it from the shelf or find it electronically via a database. There are now infinitely more options for finding information, including globally available search tools like Google Scholar and academic communities, such as Academia.edu and ResearchGate.</p>
<p>There are also more options for storing and sharing information. Users may adopt citation management software to store, annotate, and cite works, including Mendeley, Zotero, or Endnote. Broadly speaking, citation management software has the capacity to change how users manage their scholarly information collections. Citation management software provides a central place for several activities in the scholarly workflow&#x2014;storing, annotating, citing, and sharing. At present, discovery of new materials is available within these tools, but user adoption of discovery options is not high, and comes with a learning curve for users accustomed to searching the web for information. Citation management software is beginning to be seen as part of a portfolio of research management tools (see Elsevier&#x2019;s acquisition of Mendeley, and Digital Science&#x2019;s development of ReadCube and acquisition of Papers&#x2014;both information management tools). Citation management software may be more connected with other tools central to the research workflow, including lab notebook, data management, and research collaboration software (<xref ref-type="bibr" rid="r21">Schonfeld, 2018</xref>). This could raise the visibility and adoption rates of these tools in the future. </p>
<p>This article explores the concept of embedding discovery within citation management software with an eye towards other software environments in the research workflow that might also incorporate discovery. First, we review efforts to place information retrieval within citation management software, and explore related literature on situating discovery more intuitively for researchers. We look at the most frequently adopted citation management software in current use. Building upon this prior research, we describe the results of interviews with Penn State faculty members, focused on their research workflows. This research follows the findings from a qualitative, ethnographic study funded by the Andrew W. Mellon Foundation, analyzing how users at Penn State University, University Park, find, store, annotate, cite, and share information resources (<xref ref-type="bibr" rid="r1">Antonijevi&#x0107; &#x0026; Cahoy, 2014</xref>). The findings from this study indicated that two pervasive areas of disconnect exist within the scholarly workflow: discovering and saving new materials, and archiving self-authored work. We share our recent results, which confirm that faculty users welcome access to library resources within software significant to their individual research workflow. Our interviews also found a desire for more automated services (anticipatory completion of citations, analysis of existing bibliographies), as well as an enthusiasm for commercial services such as Google Scholar and Academia.edu. </p>
<p>As discovery shifts away from web-based platforms and fully situates within software and other platforms, libraries must begin locating discovery services within these tools as well. This choice will help users more intuitively find relevant information sources while connecting those resources to other critical phases in the scholarly workflow, thereby maximizing productivity and minimizing information loss.</p>
</sec>
<sec id="s2">
<title>2. Literature on Discovery&#x2014;on the Library Web Site and Elsewhere</title>
<p>Ithaka S&#x0026;R has been a major source for research on faculty members&#x2019; research behavior and discovery practices. The most recent report, the <italic>Ithaka S&#x0026;R Faculty Survey 2015</italic> looks again at where researchers begin their research&#x2014;a question that has been explored in this series since 2003 (<xref ref-type="bibr" rid="r25">Wolff-Eisenberg, Rod, &#x0026; Schonfeld, 2016</xref>). In previous years, faculty were more likely to use a discipline-specific, electronic subscription database (such as Web of Science, for example) than they were to use a broader search tool (such as Google or Google Scholar). The 2015 report notes a shift towards the broader search tools, with faculty equally as likely to search a search engine or a subscription database to serve their research needs. However, the report also discusses the rise in use of the library web site as a discovery portal (a trend which has been on the rise since 2012, perhaps coinciding with the library web development trend towards creating a front page that functions primarily as a search interface (<xref ref-type="bibr" rid="r20">Schonfeld, 2015</xref>, p. 12).</p>
<p>In &#x201C;Meeting researchers where they start: Streamlining access to scholarly resources&#x201D;, Roger Schonfeld outlines the six areas of failure relative to discovery and libraries, including the difficulty of off campus access and the shrinking profile of the library web site as a singular search destination. He notes that <italic>&#x201C;Mechanisms for content access succeed only when they conform to Lorcan Dempsey&#x2019;s observation that &#x2018;discovery happens elsewhere.&#x2019; Authentication and authorization to licensed e-resources must work effectively without regard to the researcher&#x2019;s starting point&#x201D;</italic> (<xref ref-type="bibr" rid="r20">Schonfeld, 2015</xref>, p. 3). Further, Schonfeld mandates that, <italic>&#x201C;To understand researcher practices, user experience specialists both in a library and a content provider setting should examine the researchers&#x2019; actual practices. Rather than trying to focus on specific tasks related to the system that their current project covers, as is all too often the approach taken, a more holistic, ethnographic perspective is vital&#x201D;</italic> (<xref ref-type="bibr" rid="r20">Schonfeld, 2015</xref>, p. 13). </p>
<p>In &#x201C;Thirteen ways of looking at libraries, discovery, and the catalog: Scale, workflow, attention&#x201D;, <xref ref-type="bibr" rid="r5">Dempsey (2012)</xref> discusses the movement of user search behaviors from the local library catalog to the &#x201C;network scale&#x201D;&#x2014;for example, searching Worldcat.org, a consolidated catalog, for a locally owned book. In this environment, he notes that &#x201C;syndication and leveraging strategies&#x201D; are needed, including connecting to networked resources, such as link resolver recognition within Mendeley or Google Scholar (<xref ref-type="bibr" rid="r5">Dempsey, 2012</xref>, p. 2). In this respect, the local catalog remains the data source, but the user accesses the data via a more globally available resource. Dempsey states, &#x201C;The use and mobilization of bibliographic data and services outside the library catalog is an increasingly important part of library activity. This is especially important as &#x2018;discovery increasingly happens elsewhere&#x2019;&#x2014;in other environments than in the library&#x201D; (<xref ref-type="bibr" rid="r6">Dempsey, 2016</xref>, p. 12).</p>
<p>&#x201C;Thinking the unthinkable: a library without a catalogue&#x2014;Reconsidering the future of discovery tools for Utrecht University Library&#x201D; describes the challenges facing libraries with regard to search and the significance of a destination web presence (<xref ref-type="bibr" rid="r12">Kortekaas, 2012</xref>). Citing user study trends, the Utrecht University Library decided to implement several approaches with regard to discovery: the decision not to implement a large discovery search tool; a commitment to embed library collection-related metadata to global initiatives and re-envisioning the local online public catalog as a tool primarily for known item searches.</p>
<p>Grant identified the concept of discovery within context specific software as a &#x2018;knowledge creation platform&#x2019; (<xref ref-type="bibr" rid="r10">Grant, 2013</xref>, pp. 67&#x2013;73). He highlighted the components of this platform as discovery, social networking, ready access to library expertise and services, and, perhaps most significant, &#x201C;<italic>integrated tools for creating new knowledge.&#x201D;</italic> Grant further specified that the knowledge creation tools <italic>&#x201C;should cleanly integrate within the interface of the KCP so that again, the end-user does not need to step out of the interface in order to actively work on their research or assignment&#x201D;</italic> (<xref ref-type="bibr" rid="r10">Grant, 2013</xref>, p. 71).</p>
</sec>
<sec id="s3">
<title>3. Discovery and Reference Management Software</title>
<p>Recent research articles on citation management software (from within a library-focused lens) have focused primarily on the library&#x2019;s provision of support for citation management software users (<xref ref-type="bibr" rid="r2">Childress, 2011</xref>; <xref ref-type="bibr" rid="r9">Francese, 2013</xref>; <xref ref-type="bibr" rid="r19">Rempel &#x0026; Mellinger, 2015</xref>). Less common are articles looking at the software functionality itself, and opportunities for embedding library services, including discovery options. <xref ref-type="bibr" rid="r13">Lubke, Paulus, Britt, and Atkins (2017)</xref> look at options for unifying the workflow steps connected with the literature review process, but do not address discovery, instead beginning the workflow with the act of PDF storage and annotation.</p>
<p><italic>&#x201C;As the process of citation management changed, the social dimension became more important, as citations are &#x2018;social objects&#x2019; around which connections can be made. Today, citation management programs not only provide a repository in which you can store your work, but also allow you to share your work and to search the work of others&#x201D;</italic> (<xref ref-type="bibr" rid="r7">Dempsey &#x0026; Walter, 2014</xref>). <xref ref-type="bibr" rid="r6">Dempsey (2016)</xref> notes the significance of citation management software for academic libraries beyond simply supporting use of these products: <italic>&#x201C;As some of these researcher-facing &#x2018;productivity&#x2019; services are repackaged as licensed institutional offers, libraries will face important decisions about sourcing and procurement of workflow support services.&#x201D;</italic></p>
</sec>
<sec id="s4">
<title>4. Discovery Within Citation Management&#x2014;An Overview of Current Software</title>
<p>Citation management software became readily available in the 1990s, and has experienced increased user adoption since that time. Originally envisioned primarily as a tool for citing sources and building bibliographies, citation management software has recently begun to increase its scope. Scholarly publishers, such as Elsevier and Nature/Digital Science, see the utility of embedded discovery within citation management software, and have developed discovery options within the native interface. In order to clearly illustrate the current state of discovery within citation management software, we present an overview of the major providers in this area, and the discovery and other workflow innovations included within individual software.</p>
<p><italic>The criteria for inclusion in this overview included the following factors:</italic> </p>
<list list-type="bullet">
<list-item>
<p>The citation management software is an established tool with adoption in academia.</p>
</list-item>
<list-item>
<p>The citation management software allows for storage of citation data and pdfs.</p>
</list-item>
<list-item>
<p>The citation management software works collaboratively with Microsoft Word and other word processing software to produce formatted citations in a style of choice.</p>
</list-item>
</list>
<p><italic>Optional criteria included:</italic></p>
<list list-type="bullet">
<list-item>
<p>The citation management software allows for pdf annotation and note taking within the software.</p>
</list-item>
<list-item>
<p>The citation management software includes a search or other discovery function within the software to search for and identify other relevant scholarly works.</p>
</list-item>
<list-item>
<p>The citation management software includes &#x2018;smart&#x2019; analysis of the user&#x2019;s library and suggestion of other possible relevant works of interest.</p>
</list-item>
<list-item>
<p>The citation management software allows for authentication to the user&#x2019;s academic institution, including library resource access.</p>
</list-item>
</list>
<p>Web-based citation generators, such as EasyBib or Citation Machine, did not meet the required criteria and were therefore not included in this review. </p>
<sec id="s4a">
<title>4.1. Endnote</title>
<p>One of the oldest reference managers, Endnote, has been in existence since the 1990s, and remains the most traditional software program of its kind. Endnote is produced by Clarivarate Analytics, and as such, is embedded within another company product, Web of Science (<xref ref-type="bibr" rid="r3">Clarivate Analytics, 2016</xref>). In Web of Science, Endnote users can save citations to Endnote Web or export citations out to the desktop Endnote. Endnote&#x2019;s discovery options within its web and desktop interfaces are more limited than similar software, such as Mendeley or ReadCube. A &#x2018;Find Full Text&#x2019; option allows users to match citations with affiliated PDFs, through an automated search of large repositories. &#x2018;Capture&#x2019;, a recently released Endnote bookmarklet tool, allows capture of citations from within a web browser. Users can also search for and retrieve citations through a Z39.50 search tool that includes many library catalogs. This search tool only retrieves citations, and while being one of the first examples of discovery within citation tools, it remains rudimentary in its approach and in its search results. Unlike Mendeley, analysis of a user&#x2019;s reference library (and resulting suggested relevant articles) is not available.</p>
</sec>
<sec id="s4b">
<title>4.2. Mendeley</title>
<p>Mendeley, citation and reference management software for the Web, desktop, and mobile device, was founded in 2007 (<xref ref-type="bibr" rid="r22">Shema, 2012</xref>). Initially, Mendeley&#x2019;s most significant focus was bibliographic citation management, including integration with Microsoft Word and other word processing software. In recent years, Mendeley has begun to expand its reach as a platform for collaboration and information discovery. Mendeley&#x2019;s catalog of user-uploaded citations and papers is an expansive data collection unto itself. While lacking the standardization of a traditional catalog, Mendeley users can search the Mendeley catalog from within the application, saving citations and PDFs to their reference library. On the Mendeley web site, users can also personalize their account to add finding full text options linking to their home institution resources. A newer feature, Mendeley Suggest, analyzes an individual user&#x2019;s library and provides relevance ranked articles based on works in their existing library. In 2013, the publishing conglomerate Elsevier bought Mendeley. The acquisition has (to date) not changed how users access articles from within Mendeley. It has broadened how users can access Mendeley, including the embedding of Mendeley services within Elsevier-provided databases (such as ScienceDirect).</p>
</sec>
<sec id="s4c">
<title>4.3. Zotero</title>
<p>Like Mendeley, Zotero is a more recent tool (released in 2006) and has a web and standalone version available to users (<xref ref-type="bibr" rid="r26">Zotero, 2008</xref>). The original version of Zotero existed entirely within the Web browser (as a plugin for Firefox/Chrome/Safari/IE). A later version extended a desktop version, Zotero Standalone. Until 2016, options for discovery were not available within the Zotero interface, perhaps because Zotero was situated directly within the discovery portal itself (the Web browser). A new optimization created in 2016 (and funded by a Mellon Foundation grant awarded to the author and Zotero) allows for user created feeds within the Zotero interface. These feeds can be for a search, a journal title (and affiliated new articles) or other relevant syndicated content. The user views the feeds within the Zotero interface, and selects specific items from the feed to add to their Zotero library. There is a complexity to this feature, in that a user must know how to create and put to use an RSS feed. As this is a very new service (still in beta testing) within Zotero, its utility and level of usage remain to be seen.</p>
</sec>
<sec id="s4d">
<title>4.4. ReadCube</title>
<p>ReadCube, software developed in 2011 by Digital Science &#x0026; Research Solutions, is a reference management program with search at the center of the interface. New users are encouraged to deposit their PDF collections within ReadCube, which then processes the articles by DOI and other metadata, and (in the paid version) allows users to automatically connect with and retrieve articles citing and cited by articles in the user&#x2019;s collections. In other words, ReadCube places discovery of new materials entirely within the user interface. ReadCube also has agreements with several large publishers, including Nature Publishing Group, Frontiers, and Wiley Publishing to feature their journal articles (including the option to purchase access to articles) within their interface. The user has the option to search several different catalogs from within the ReadCube interface, including Google Scholar, PubMed, and the ReadCube catalog. The option to authenticate with an institution specific proxy to aid in full text discovery is also available. ReadCube also recently acquired Papers<sup>3</sup>, another reference management tool featuring embedded search options (<xref ref-type="bibr" rid="r8">Digital Science &#x0026; Research Solutions, 2016</xref>). As Mac only software, Papers<sup>3</sup> features embedded search similar to ReadCube, but also offers enhanced citation management capabilities. Digital Science &#x0026; Research Solutions&#x2019; acquisition of Papers<sup>3</sup> may be with an eye towards bringing the strengths of Papers<sup>3</sup> and ReadCube together within one tool in the future.</p>
</sec>
<sec id="s4e">
<title>4.5. Other Citation Managers</title>
<p>A wide range of reference management systems exist for users with specific needs. ProQuest owned Refworks is an older tool, created in 2001 and entirely Web-based. Marketed heavily to libraries, access to Refworks can be embedded within library databases. Refworks also takes institutional affiliation and employs it within the interface as an institution specific link to full text articles (<xref ref-type="bibr" rid="r18">Refworks, 2016</xref>). Sente reference management software is Mac only, and features an embedded browser directly within the interface. The user does not need to leave the Sente environment to search for and retrieve resources. Sente&#x2019;s &#x2018;targeted browsing&#x2019; features allow users to search for information on supported websites, seeing automatically which articles are already in their library. Within Sente, options also include the ability to automate regular searches of selected databases (PubMed, Web of Science, Z39.50 tools) (<xref ref-type="bibr" rid="r24">Third Street Software, 2016</xref>). While discovery is more separate in Sente, it is integrated within the software in a manner that allows users to find, store, annotate, and cite from one tool. Perhaps because they are Mac only, Sente and Papers<sup>3</sup> have not experienced the adoption levels of Mendeley, Zotero, and Endnote. Other tools with smaller user bases (not studied in this article) include BibDesk, JabRef, Citavi, CiteULike, and Connotea.</p>
<p>Integration of library discovery services within citation management software remains limited. Of the tools detailed in this article, there are few options to link directly within a reference management interface to library databases or other services. ReadCube provides perhaps the best access, asking users explicitly for their institutional affiliation and authentication information. ReadCube also prompts the user to login for authentication as the article retrieval process automatically begins within the interface. Endnote provides the opportunity to enter an authentication URL and an SFX resolver in preferences to aid in connecting with article PDFs via &#x2018;Find Full Text&#x2019;. Endnote Z39.50 search of library catalogs and other resources is rudimentary and perhaps one of the first examples of a search tool within citation software. Endnote also supports searching selected subscription databases (although this feature typically only works via a private subscription rather than an institutional one). Zotero features RSS feeds for discovery, which could be generated from library subscription databases. Mendeley previously offered the option (on their web interface) to enter an authentication URL and connect through the library to subscription articles. In 2016, this option was changed to only provide a DOI search to a journal provider (<xref ref-type="bibr" rid="r11">Gunn, 2016</xref>). Mendeley also features a newer service, Mendeley Suggest, which analyzes the user&#x2019;s library and offers recommended citations based on the user&#x2019;s library data. ReadCube offers a similar service as well, and their enhanced PDF optimization makes references within an individual article clickable, simplifying retrieval of related works. If anything, the current development trajectory for reference management software indicates that emphasis on the journal provider, rather than the user&#x2019;s academic institution, will remain at the forefront in the near future. Work and advocacy is needed from academic libraries to ensure greater recognition of the important role of library services and institutional authentication in reference management software use.</p>
</sec>
</sec>
<sec id="s5">
<title>5. Methodology and Prior Study Results</title>
<p>In 2012, the author received a grant from the Andrew W. Mellon Foundation to conduct research on faculty management of information within the scholarly workflow, including discovery and self-archiving of significant works (<xref ref-type="bibr" rid="r16">Penn State University, 2016</xref>). This grant was followed up in 2014 by another Mellon Foundation grant, enabling further research on the scholarly workflow, including software development by George Mason University&#x2019;s citation management software, Zotero, to embed new options for workflow management within the Zotero environment (<xref ref-type="bibr" rid="r15">Penn State University, 2014</xref>). </p>
<sec id="s5a">
<title>5.1. Study Results: 2012 Study on Faculty Scholarly Workflow</title>
<p>The results of the 2012 study on faculty scholarly workflow were shared in the article, &#x201C;Personal Library Curation: An Ethnographic Study of Scholars&#x2019; Information Practices&#x201D; (<xref ref-type="bibr" rid="r1">Antonijevi&#x0107; &#x0026; Cahoy, 2014</xref>). The article presents the results of a web-based survey of scholars (n&#x003D;196), as well as the analysis of ethnographic interviews with 23 Penn State faculty members during the same time period. The survey and interviews indicated, across faculty, a preference for electronic searches for information sources, more often using commercial sources (such as Google or Google Scholar) than more local, library-based resources (although Humanities researchers were more likely to start with library databases). Faculty also relied heavily on their own personal collections of article PDFs and data. With regard to citation management software, the 2012 study found limited use of the software, slightly more than 50&#x0025; of surveyed Penn State faculty in the Sciences, and 30&#x0025; in the Humanities. Faculty noted dissatisfaction with citation management software as a reason for non-adoption. When queried, survey respondents indicated as a majority that the responsibility for education on workflow management resided with the scholar, and not the library or campus librarians. With the results of the survey and interviews combined, the study found overall that faculty experience a pervasive disconnect between the activities of finding information (typically in a web-based, commercial service) and annotating and citing the information (within Microsoft Word or citation management software). Similarly, the act of archiving was also disconnected from the research process, with a majority of respondents indicating that they had lost important files or data. With this portrait of a disconnect existing within the scholarly workflow, particularly within the areas of discovery and self-archiving, the 2014 study was created to begin to explore and address this need.</p>
</sec>
<sec id="s5b">
<title>5.2. 2014 Study on Software Optimizations and Impact on the Scholarly Workflow</title>
<p>In the second stage of our study, we conducted usability testing and interviews, focused on new enhancements in the areas of discovery and archiving added to Zotero, citation management software. The author worked with Zotero software developers to embed new functionality within Zotero, based on the findings re: the disconnectedness of the faculty scholarly workflow within the first phase of this study. Two specific enhancements were added to Zotero (and as of 2016 are publicly available to all users) as a result of the initial study (<xref ref-type="bibr" rid="r23">Takats, 2014</xref>). The first, addition of RSS feeds, addressed issues with the disconnected nature of discovery in relation to citation management software. The capability to add RSS feeds of any kind (including those pointing to journal level table of contents or targeted article or database keyword searches) was embedded within the Zotero interface. The user identified the RSS feed (from outside Zotero) and then entered the feed URL into the Zotero interface. Once the feed was accepted, it would begin retrieving results, which the user could browse within the Zotero interface, and selectively decide whether to add individual results to the user&#x2019;s Zotero library. The second optimization focused on self-archiving of authored works. Zotero created a new &#x2018;My Publications&#x2019; folder (the feature is also seen in other citation managers, such as Mendeley and Endnote). The &#x2018;My Publications&#x2019; folder is intended to be populated with the user&#x2019;s authored works. Once in the folder, the user may decide on access levels for publications, and may feature the publication(s) on their public Zotero profile. An additional enhancement created as a result of this study connected Penn State&#x2019;s institutional Hydra-based repository, ScholarSphere, with the &#x2018;My Publications&#x2019; folder, so that Penn State users could easily self-archive both on the Zotero server and within the Penn State IR. The code to connect a Hydra repository and Zotero is now publicly available for other institutions use (<xref ref-type="bibr" rid="r17">projecthydra, 2016</xref>).</p>
</sec>
<sec id="s5c">
<title>5.3. Post-usability Interview Findings</title>
<p>A total of eight (8) subjects, four graduate students and four faculty members, participated in usability testing of the Zotero enhancements and a follow-up discussion of the utility of the new features. The small sample size for the usability testing was taken from <xref ref-type="bibr" rid="r14">Nielsen and Landauer&#x2019;s recommendation (1993)</xref> that found the highest ratio of benefit to cost was found in a usability testing pool of between six and eight evaluators. The participants were divided between the social sciences and the sciences, with one Humanities graduate student. Audio recordings and transcripts were created for the sessions as well. The post-usability testing interviews lasted approximately 30-45 minutes in length. After usability testing concluded, each interviewee was asked a series of questions related to Zotero software and their own scholarly workflow needs (Appendix A). A majority of the participants had also participated in the 2014 study, and used citation management software regularly in their work. Although broad conclusions perhaps cannot be drawn from the interview findings, this pool of expert and enthusiastic citation management software users&#x2019; feedback and ideas provide food for thought on the future of these tools. While the initial Zotero optimizations proved challenging for usability testing, the post-usability testing interviews brought forward unique findings on what users expect and hope for with regard to workflow tools. </p>
<p>In the interviews, a majority of subjects referenced the disconnectedness of their scholarly workflow, and indicated the need for increased discovery options within the citation management software interface. General trends that emerged across the interviews addressed &#x2018;smarter&#x2019; functionality within software (citation management software and word processing software), the value of commercial, broadly available scholarly services (such as Google Scholar, Academia.edu, and Research Gate), and a perceived lack of value for local storage and social networking services, including the institutional repository. While there are detailed findings in all of these areas, we will focus in this article on the findings related primarily to discovery and within the workflow as a whole.</p>
</sec>
<sec id="s5d">
<title>5.4. Automating and Connecting the Scholarly Workflow</title>
<p>In general, greater automation of the scholarly research process was desired by multiple interview subjects. Several subjects mentioned the ability for citation management software to automatically &#x2018;complete&#x2019; incomplete citations (according to individual citation style needs) without intervention or direction from the user. Another subject mentioned &#x2018;anticipatory&#x2019; automation, where (within Microsoft Word) the citation management software would automatically complete a citation based on the references discussed in a specific paragraph. A tenured faculty member shared a &#x2018;wish list&#x2019; of optimizations that he termed as &#x201C;customizable automation&#x201D;: these included natural language searching, validation of localized services from within the tool, and notifications when new citations are found so that users can validate entries.</p>
</sec>
<sec id="s5e">
<title>5.5. Discovery Feeds</title>
<p>Four of the subjects interviewed in the study indicated that they already receive new content alerts (for new journal articles, etc&#x2026;) in their email accounts. All of the interview subjects were positive about the new discovery RSS feeds within Zotero. The graduate students preferred to have email alerts continue, in addition to receiving new citation feeds within Zotero. One graduate student noted that this dual notification would be a good reminder to go back into Zotero and engage with new sources. A tenured faculty member said of the utility of the feeds, &#x201C;I think the main role is obviously discovering a new work. Since I do some variations on that, I would probably use it. It&#x2019;s interesting now that I really think about it. I used to use RSS feeds all the time.&#x201D; This faculty member also noted that &#x201C;It&#x2019;s much more random now than it used to be. It&#x2019;s much more pull rather push.&#x201D; New research (for whatever reason) had stopped flowing naturally to his workflow, and he welcomed an option that might change this. Another faculty member stated that he saw the feeds as a positive enhancement, yet would not use them within Zotero. He preferred to have his alerts continue to arrive in his email, where he could search his entire email collection to find and retrieve specific items. </p>
</sec>
<sec id="s5f">
<title>5.6. Embedded Discovery Services</title>
<p>All of the usability subjects were enthusiastic about multiple services (including library authentication and content) within the citation management software interface. In one participant&#x2019;s words, it would give her the ability to &#x201C;multitask on one screen.&#x201D; Another was positive about this, and stipulated the institution-specific authentication must also be integrated into the interface in order for this enhancement to be useful. A faculty member expressed a desire for natural language searching within the citation software, as well as the ability to automatically receive new relevant citations within the interface, with an alert for the user to validate and accept citations. Another faculty member had a more intricate idea for embedded services&#x2014;analysis of existing bibliographies, combined with embedded discovery and authentication to bring relevant new works automatically to the user. In essence, this idea is that the citation management software looks at publications the user has written. It extracts the data from the publication bibliography, and retrieves cited works that are not currently in the user&#x2019;s library, for the user to accept. It also does analysis on relevant works (based on the cited works in the bibliography) and asks the user to accept those citations as well. </p>
</sec>
</sec>
<sec id="s6">
<title>6. Discussion and Conclusion</title>
<p>Our post-usability testing interview participants were uniformly focused on using commercial search and software tools (Mendeley, Zotero, ResearchGate, Google Scholar), and saw the benefits of accessing and utilizing resources on platforms that are not primarily locally developed. They were open to embedded discovery services, and utilizing these services within citation management software. It seemed that perhaps the biggest user barrier was in learning how to use and integrate citation management software into one&#x2019;s workflow. Once that was achieved (as was the case with a majority of our participants, with most of them admittedly expert users), the idea of adding on additional services seemed natural and realistic. </p>
<p>What do these findings mean for citation management software designers? For the Zotero designers, there are several clear outcomes to share. Our interview subjects were enthusiastic about discovery within the citation software interface. They expressed a desire for better automation of tasks, across the board. Software providers should consider the primacy of email as an information collection (and as an alert/reminder service) for users, offering the option of emailing users when new citations are found by the feed. Users valued their email collections, and looked to their email as a reminder to return to workflows in more disconnected applications, such as Zotero.</p>
<p>Software providers should also consider ways to mine user&#x2019;s existing citation collections for additional recommended citations. As we previously mentioned, other citation managers have begun this service, including Mendeley Suggest and ReadCube Recommendations<italic>.</italic> It makes simple sense to mine the data that the researcher has already deposited to increase the utility of the tool within the scholarly workflow. The idea suggested by one of our subjects to pay special attention to importing works already cited in the researcher&#x2019;s publications would again likely be a huge value-added service for users. Customization, automation, and predictive (i.e. smart) services are what the users in our study clearly wanted.</p>
<p>There are also recommendations for integration within word processing software. Our participants were as a group, clear that they wanted better integration with Word processing software, including infusing discovery into the word processing environment. How could this work? It might mean embedding semantic web capabilities within word processing software, for uses such as predicting / completing citations for the user, finding works attached to quotes used in text, etc&#x2026; This is a new area of development for citation management software and research workflow software in general, and one that should be taken seriously.</p>
<p>Like the conclusions leading away from local tool provision in Kortekaas &#x2018;<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7910/DVN/AWDACG">Thinking the Unthinkable</ext-link>&#x2019; (2012), these findings are a clarion call for academic libraries&#x2019; discovery, storage, and instructional strategies. The significant &#x2018;critical mass&#x2019; of other researchers that one of our subjects referenced is not present on local tools, such as the online catalog or the institutional repository. The general focus of the post-usability testing interviews was to determine the utility of discovery and archiving within the Zotero interface. The findings were unanimous among our subjects that localized content and services are welcomed within citation management software. The challenge now is for software providers, publishers, and academic libraries to begin embedding content where our users are rather than where we want them to go (library web sites, publisher web sites, subscription databases). This requires cross-institutional work on the part of academic libraries; large developments like this can&#x2019;t occur on a campus by campus basis. It also means that academic libraries must begin to give up local development of services that are not heavily or intuitively used by their core user groups. From an ego or vanity perspective, this will be difficult for academic libraries. Yet, if it means that the resources a user needs are directly (to borrow a phrase from Lorcan Dempsey) &#x201C;in the flow&#x201D; when and where they need it, haven&#x2019;t the library&#x2019;s goals as a content provider been met? (<xref ref-type="bibr" rid="r4">Dempsey, 2005</xref>) With a continued focus on how users find information <italic>outside</italic> the library web site, and beyond that, outside the web browser, software providers and libraries can begin to close the gap and bring resources to users more easily from directly within their research workflow.</p>
</sec>
</body>
<back>
<ack>
<p>We thank the Andrew W. Mellon Foundation for their generous support of the research described in this article. Thank you to Smiljana Antonijevi&#x0107; Ubois, Jeff Edmunds, Eric Novotny, and John Meier for their valuable feedback and insights on the research studies described in this article.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="r1"><mixed-citation>Antonijevi&#x0107;, S., &#x0026; Cahoy, E.S. (2014). Personal library curation: An ethnographic study of scholars&#x2019; information practices. <italic>Portal: Libraries and the Academy</italic>, <italic>14</italic>(2), 287&#x2013;306. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1353/pla.2014.0010">https://doi.org/10.1353/pla.2014.0010</ext-link>.</mixed-citation></ref>
<ref id="r2"><mixed-citation>Childress, D. (2011). Citation tools in academic libraries: Best practices for reference and instruction. <italic>Reference and User Services Quarterly</italic>, <italic>51</italic>(2), 143&#x2013;152. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5860/rusq.51n2.143">https://doi.org/10.5860/rusq.51n2.143</ext-link></mixed-citation></ref>
<ref id="r3"><mixed-citation>Clarivate Analytics. (2016). Acquisition of the Thomson Reuters Intellectual Property and Science business by Onex and Baring Asia Completed. Retrieved August 7, 2017, from <ext-link ext-link-type="uri" xlink:href="http://www.prnewswire.com/news-releases/acquisition-of-the-thomson-reuters-intellectual-property-and-science-business-by-onex-and-baring-asia-completed-300337402.html">http://www.prnewswire.com/news-releases/acquisition-of-the-thomson-reuters-intellectual-property-and-science-business-by-onex-and-baring-asia-completed-300337402.html</ext-link>.</mixed-citation></ref>
<ref id="r4"><mixed-citation>Dempsey, L. (2005). <italic>In the flow</italic>. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://orweblog.oclc.org/in-the-flow/">http://orweblog.oclc.org/in-the-flow/</ext-link>.</mixed-citation></ref>
<ref id="r5"><mixed-citation>Dempsey, L. (2012). Thirteen ways of looking at libraries, discovery, and the catalog: Scale, workflow, attention. <italic>EDUCAUSE Review Online</italic>, (December), 1&#x2013;14. Retrieved from <ext-link ext-link-type="uri" xlink:href="http://er.educause.edu/articles/2012/12/thirteen-ways-of-looking-at-libraries-discovery-and-the-catalog-scale-workflow-attention">http://er.educause.edu/articles/2012/12/thirteen-ways-of-looking-at-libraries-discovery-and-the-catalog-scale-workflow-attention</ext-link>.</mixed-citation></ref>
<ref id="r6"><mixed-citation>Dempsey, L. (2016). Library collections in the life of the user: two directions. <italic>LIBER Quarterly</italic>, <italic>26</italic>(4), 338&#x2013;359. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18352/lq.10170">https://doi.org/10.18352/lq.10170</ext-link>.</mixed-citation></ref>
<ref id="r7"><mixed-citation>Dempsey, L., &#x0026; Walter, S. (2014). A platform publication for a time of accelerating change. <italic>College &#x0026; Research Libraries</italic>, <italic>75</italic>(6), 760&#x2013;762. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5860/crl.75.6.760">https://doi.org/10.5860/crl.75.6.760</ext-link>.</mixed-citation></ref>
<ref id="r8"><mixed-citation>Digital Science. (2016). <italic>ReadCube acquires Papers from Springer Nature</italic>. Digital Science. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="https://www.digital-science.com/press-releases/readcube-acquires-papers-springer-nature/">https://www.digital-science.com/press-releases/readcube-acquires-papers-springer-nature/</ext-link>.</mixed-citation></ref>
<ref id="r9"><mixed-citation>Francese, E. (2013). Usage of reference management software at the University of Torino. <italic>JLIS.it</italic>, <italic>4</italic>(2), 145&#x2013;174. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4403/jlis.it-8679">https://doi.org/10.4403/jlis.it-8679</ext-link>.</mixed-citation></ref>
<ref id="r10"><mixed-citation>Grant, C. (2013). Knowledge creation platforms: The next step after web-scale discovery. <italic>027.7 Zeitschrift F&#x00FC;r Bibliothekskultur/Journal of Library Culture</italic>, <italic>1</italic>(2), 67&#x2013;73. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.12685/027.7-1-2-32">https://doi.org/10.12685/027.7-1-2-32</ext-link>.</mixed-citation></ref>
<ref id="r11"><mixed-citation>Gunn, W. (2016). Twitter post. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="https://twitter.com/mrgunn/status/771722800797130752">https://twitter.com/mrgunn/status/771722800797130752</ext-link>.</mixed-citation></ref>
<ref id="r12"><mixed-citation>Kortekaas, S. (2012). <italic>Thinking the unthinkable: a library without a catalogue &#x2013; Reconsidering the future of discovery tools for Utrecht University Library</italic>. LIBER. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://www.libereurope.eu/blog/thinking-the-unthinkable-a-library-without-a-catalogue-reconsidering-the-future-of-discovery-to">http://www.libereurope.eu/blog/thinking-the-unthinkable-a-library-without-a-catalogue-reconsidering-the-future-of-discovery-to</ext-link>.</mixed-citation></ref>
<ref id="r13"><mixed-citation>Lubke, J., Paulus, T.M., Britt, V.G., &#x0026; Atkins, D.P. (2017). Hacking the literature review: Opportunities and innovations to improve the research process. <italic>Reference &#x0026; User Services Quarterly</italic>, <italic>56</italic>(4), 285&#x2013;295. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5860/rusq.56.4.285">https://doi.org/10.5860/rusq.56.4.285</ext-link></mixed-citation></ref>
<ref id="r14"><mixed-citation>Nielsen, J., &#x0026; Landauer, T.K. (1993). A mathematical model of the finding of usability problems. In <italic>Proceedings of the INTERACT&#x2019;93 and CHI&#x2019;93 Conference on Human Factors in Computing Systems</italic> (pp. 206&#x2013;213). New York: ACM Digital Library.</mixed-citation></ref>
<ref id="r15"><mixed-citation>Penn State University. (2014). <italic>Penn State awarded Mellon grant to study personal scholarly archiving</italic>. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://news.psu.edu/story/312098/2014/04/15/academics/penn-state-awarded-mellon-grant-study-personal-scholarly-archiving">http://news.psu.edu/story/312098/2014/04/15/academics/penn-state-awarded-mellon-grant-study-personal-scholarly-archiving</ext-link>.</mixed-citation></ref>
<ref id="r16"><mixed-citation>Penn State University. (2016). <italic>Penn State librarian leads exploration of scholarly research activities.</italic> Penn State University. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://news.psu.edu/story/418547/2016/07/25/academics/penn-state-librarian-leads-exploration-scholarly-research">http://news.psu.edu/story/418547/2016/07/25/academics/penn-state-librarian-leads-exploration-scholarly-research</ext-link>.</mixed-citation></ref>
<ref id="r17"><mixed-citation>projecthydra. (2016). <italic>Sufia - Zotero Integration</italic>. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="https://github.com/projecthydra/sufia/blob/master/README.md&#x0023;zotero-integration">https://github.com/projecthydra/sufia/blob/master/README.md&#x0023;zotero-integration</ext-link>.</mixed-citation></ref>
<ref id="r18"><mixed-citation>Refworks. (2016). <italic>About us</italic>. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="https://www.refworks.com/content/about_us.asp">https://www.refworks.com/content/about_us.asp</ext-link>.</mixed-citation></ref>
<ref id="r19"><mixed-citation>Rempel, H.G., &#x0026; Mellinger, M. (2015). Bibliographic management tool adoption and use &#x2013; A qualitative research study using the UTAUT model. <italic>Reference and User Services Quarterly</italic>, <italic>54</italic>(4), 43&#x2013;53. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5860/rusq.54n4.43">https://doi.org/10.5860/rusq.54n4.43</ext-link>.</mixed-citation></ref>
<ref id="r20"><mixed-citation>Schonfeld, R.C. (2015). <italic>Meeting researchers where they start: Streamlining access to scholarly resources</italic>. Ithaka S&#x002B;R. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18665/sr.241038">https://doi.org/10.18665/sr.241038</ext-link>.</mixed-citation></ref>
<ref id="r21"><mixed-citation>Schonfeld, R.C. (2018). <italic>Big deal: Should universities outsource more core research infrastructure?</italic> Ithaka S&#x002B;R. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18665/sr.306032">https://doi.org/10.18665/sr.306032</ext-link>.</mixed-citation></ref>
<ref id="r22"><mixed-citation>Shema, H. (2012). <italic>Interview with Dr. Victor Henning, Mendeley.</italic> Scientific American Blog Network. Retrieved August 7, 2017, from <ext-link ext-link-type="uri" xlink:href="https://blogs.scientificamerican.com/information-culture/interview-with-dr-victor-henning-mendeley/">https://blogs.scientificamerican.com/information-culture/interview-with-dr-victor-henning-mendeley/</ext-link>.</mixed-citation></ref>
<ref id="r23"><mixed-citation>Takats, S. (2014). <italic>Feeds and institutional repositories coming to Zotero</italic>. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="https://www.zotero.org/blog/feeds-and-institutional-repositories-coming-to-zotero/">https://www.zotero.org/blog/feeds-and-institutional-repositories-coming-to-zotero/</ext-link>.</mixed-citation></ref>
<ref id="r24"><mixed-citation>Third Street Software. (2016). <italic>Sente 6 - Academic Reference Manager for Mac, iPad and iPhone.</italic> Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://www.thirdstreetsoftware.com/site/Sente.html">http://www.thirdstreetsoftware.com/site/Sente.html</ext-link>.</mixed-citation></ref>
<ref id="r25"><mixed-citation>Wolff-Eisenberg, C., Rod, A.B., &#x0026; Schonfeld, R.C. (2016). <italic>Ithaka S&#x002B;R US faculty survey 2015</italic>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18665/sr.277685">https://doi.org/10.18665/sr.277685</ext-link>.</mixed-citation></ref>
<ref id="r26"><mixed-citation>Zotero. (2008). About Zotero. Retrieved August 17, 2017, from <ext-link ext-link-type="uri" xlink:href="http://zotero.pbworks.com/w/page/5511965/About">http://zotero.pbworks.com/w/page/5511965/About</ext-link> Zotero.</mixed-citation></ref>
</ref-list>
<app-group>
<app>
<title>Appendix A: Interview Questions</title>
<p>(An edited transcription of selected interview responses can be found at the Harvard Dataverse: <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7910/DVN/AWDACG">https://doi.org/10.7910/DVN/AWDACG</ext-link>).</p>
<p><list list-type="order">
<list-item><p>What do you see as the main functions of the software you used today, Zotero and Scholarsphere?</p></list-item>
<list-item><p>What was the most difficult or frustrating part of using Zotero? Why? What was the easiest? Why?</p></list-item>
<list-item><p>What was the most difficult or frustrating part of using Scholarsphere? Why? What was the easiest? Why?</p></list-item>
<list-item><p>What do you see as the main function of the My Publications folder in Zotero? How likely would you use it in your work; why or why not?</p></list-item>
<list-item><p>What do you see as the main role of the feed functionality in Zotero? How likely would you use it in your work; why or why not?</p></list-item>
<list-item><p>What citation software (define if needed) if any, do you typically use to facilitate your research? If you don&#x2019;t use citation software, can you describe how you create bibliographic references and save your research materials?</p></list-item>
<list-item><p>If you were to envision an ideal system for finding, citing and saving your research what would it look like?</p></list-item>
<list-item><p>Is the ability to export your publications from a citation tool like Zotero to a repository like Scholarsphere valuable to your work? Why? Is this something you see yourself doing? (offer options, never, infrequently, regularly, often, always)</p></list-item>
<list-item><p>How often do you go back to and review the articles that you are collecting for your coursework / research?</p></list-item>
</list></p>
</app>
</app-group>
</back>
</article>