<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article article-type="research-article" xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">LIBER</journal-id>
<journal-title-group>
<journal-title>LIBER QUARTERLY</journal-title>
</journal-title-group>
<issn pub-type="epub">2213-056X</issn>
<publisher>
<publisher-name>openjournals.nl</publisher-name>
<publisher-loc>The Hague, The Netherlands</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">lq.18689</article-id>
<article-id pub-id-type="doi">10.53377/lq.18689</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Determinants of Downloads as Demand for Hybrid Journals: Rationale for Bundling Services</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-6663-7798</contrib-id>
<name>
<surname>Asai</surname>
<given-names>Sumiko</given-names>
</name>
<email>asai@meiji.ac.jp</email>
<xref ref-type="aff" rid="aff1"/>
</contrib>
<aff id="aff1">Meiji University, Tokyo, Japan</aff>
</contrib-group>
<pub-date pub-type="epub">
<month>05</month>
<year>2024</year>
</pub-date>
<volume>34</volume>
<fpage>1</fpage>
<lpage>11</lpage>
<permissions>
<copyright-statement>Copyright 2024, The copyright of this article remains with the author</copyright-statement>
<copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.liberquarterly.eu/article/10.53377/lq.18689"/>
<abstract>
<p>Although Big Deal contracts that provide access rights to all electronic journals published by the publisher initially gained favor with university libraries, some libraries have terminated these contracts owing to increased charges since the 2010s. Consequently, they are faced with the problem of selecting journals for purchase within their limited budgets. This study investigates the factors affecting the number of downloads, representing journal demand, to provide libraries with guidance on journal selection. The download equation for 1,485 hybrid journals published by Springer Nature is formulated using ordinary least squares. The results found that 5&#x0025; and 50&#x0025; of the 1,485 journals generated approximately 30&#x0025; and 85&#x0025; of the downloads in 2022, respectively. Downloads are concentrated in fewer journals, although the Pareto principle does not apply to hybrid journals. Demand concentration implies that libraries do not need to maintain access rights to all journals. Recently, a few leading publishers have provided access rights to almost all electronic journals based on transformative agreements aiming to promote open access. Therefore, this study&#x2019;s findings raise the issue of the rationale for bundling electronic journals in transformative agreements, which is similar to Big Deal. Moreover, the results of the download estimation reveal that hybrid journals with more open access articles, larger citation scores, and longer histories acquire more downloads. These findings indicate that open access accelerates the dissemination of research.</p>
</abstract>
<kwd-group>
<kwd>download</kwd>
<kwd>journal demand</kwd>
<kwd>Pareto principle</kwd>
<kwd>Big Deal</kwd>
<kwd>transformative agreement</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1. Introduction</title>
<p>In the 2000s, many university libraries signed Big Deal contracts&#x2014;the bundling services of printed and all electronic journals&#x2014;with leading publishers, such as Elsevier and Springer Nature. However, some libraries have terminated their services since the 2010s owing to increased charges (<xref ref-type="bibr" rid="r15">Olsson et al., 2020</xref>; <xref ref-type="bibr" rid="r16">Pedersen et al., 2014</xref>; <xref ref-type="bibr" rid="r18">Rodr&#x00ED;guez-Bravo et al., 2021</xref>). Instead of acquiring access rights to all electronic journals based on Big Deal contracts, they have reverted to the practice of subscribing to individual journals, purchasing individual articles called the pay-per-view model, and relying on interlibrary loans (<xref ref-type="bibr" rid="r16">Pedersen et al., 2014</xref>). Nevertheless, as subscription prices have steadily increased, university libraries face the dilemma of determining which journals to subscribe to in order to fulfill researchers&#x2019; needs within their constrained budgets. Transformative agreements aiming to shift hybrid journals to fully open access journals have recently become pervasive in the academic journal market. A few leading publishers provide research institutions that sign transformative agreements with the bundling service of almost all journals. The services based on the transformative agreements, which provide access to a large number of electronic journals by adding small costs, including article processing charges, have similarities with Big Deal services. Hence, the issue of the rationale for bundling services persists today. <xref ref-type="bibr" rid="r11">Emrani et al. (2010)</xref> and <xref ref-type="bibr" rid="r24">Zhu and Xiang (2016)</xref> showed that the Pareto principle, which suggests that 80&#x0025; of demand originates from 20&#x0025; of supply, applies to the academic journal market. Contrastingly, <xref ref-type="bibr" rid="r13">Loan and Mushtaq (2023)</xref> found that the Pareto principle does not apply to oncology journals, despite the concentration of demand in a small number of journals. We examine the usefulness of bundling services by investigating the distribution of downloads, including the issue of whether the Pareto principle applies to target journals.</p>
<p>As for-profit publishers have not released their circulation data since approximately 2000 (<xref ref-type="bibr" rid="r2">Bergstrom, 2001</xref>), we cannot precisely investigate the subscription demand for individual journals. In contrast, Springer Nature announces the number of article downloads and annual number of journal downloads on its journals&#x2019; websites. <xref ref-type="bibr" rid="r1">Asai (2022)</xref>, <xref ref-type="bibr" rid="r7">Davis (2011)</xref>, and <xref ref-type="bibr" rid="r22">Wang et al. (2015)</xref> found a higher number of downloads of open access articles than of non-open access articles. Furthermore, <xref ref-type="bibr" rid="r14">Moed (2005)</xref> revealed that approximately 40&#x0025; of article downloads occurred within six months from the release. Several other studies have also demonstrated that downloads are concentrated shortly after an article is released (<xref ref-type="bibr" rid="r1">Asai, 2022</xref>; <xref ref-type="bibr" rid="r8">Davis, 2013</xref>; <xref ref-type="bibr" rid="r9">Ding et al., 2021</xref>; <xref ref-type="bibr" rid="r19">Schloegl &#x0026; Gorraiz, 2010</xref>; <xref ref-type="bibr" rid="r23">Watson, 2007</xref>). Moreover, several studies examined the relationship between downloads and citations (<xref ref-type="bibr" rid="r5">Congleton et al., 2022</xref>; <xref ref-type="bibr" rid="r9">Ding et al., 2021</xref>; <xref ref-type="bibr" rid="r12">Fern&#x00E1;ndez-Ramos et al., 2023</xref>; <xref ref-type="bibr" rid="r14">Moed, 2005</xref>; <xref ref-type="bibr" rid="r20">Schl&#x00F6;gl et al., 2014</xref>; <xref ref-type="bibr" rid="r21">Vaughan et al., 2017</xref>). <xref ref-type="bibr" rid="r21">Vaughan et al. (2017)</xref> demonstrated that the correlation coefficients between downloads and citations ranged from 0.57 in medicine to 0.88 in engineering and the humanities using a Chinese database of academic journals. <xref ref-type="bibr" rid="r12">Fern&#x00E1;ndez-Ramos et al. (2023)</xref> found a positive correlation between downloads and citations using <ext-link ext-link-type="uri" xlink:href="https://support.proquest.com/s/article/COUNTER-Journal-Report-1?language=en_US">COUNTER Journal Report 1</ext-link>. <xref ref-type="bibr" rid="r9">Ding et al. (2021)</xref> identified a significant causal relationship between the downloads and citations of articles in <italic>The Lancet</italic> using a vector autoregressive model. <xref ref-type="bibr" rid="r20">Schl&#x00F6;gl et al. (2014)</xref> calculated the correlation coefficients between downloads, citations, and readership data and found positive relationships. <xref ref-type="bibr" rid="r5">Congleton et al. (2022)</xref> examined the relationship between citations and downloads for a journal published by Springer Nature using ordinary least squares (OLS) and found that the two variables have positive impacts on each other. Thus, several studies have focused on the relationship between citations and downloads; few have examined the influence of factors other than citations on journal demand. This study explores the number of downloads as a variable representing journal demand, instead of the number of circulations, to identify the determinants of demand. As this study investigates only Springer Nature journals owing to limited data availability, the results cannot be generalized to overall academic journals. Nevertheless, findings on the distribution of downloads, including the Pareto principle, and the download determinants could provide guidance to libraries entering into subscription contracts with publishers.</p>
</sec> 
<sec id="s2">
<title>2. Methodology</title>
<p>This study targeted 1,485 hybrid journals published by Springer Nature in 2022. All journals were indexed in Scopus, and at least one open access article was published. In addition, this study constructed a download equation using OLS. OLS is the most basic regression model and determines the estimators that minimize the sum of squared residuals. The download equation is formulated as follows:</p>
<p><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/LIBER_2024_34_Assai_eq1.jpg"/></p>
<p>where <italic>ln</italic> represents the natural logarithm. Taking the natural logarithm of the variables allows us to interpret the estimators as elasticities of download to the respective variables. This study used <italic>Article</italic>, <italic>OpenRatio</italic>, <italic>CiteScore</italic>, <italic>Issue</italic>, <italic>Year</italic>, <italic>Society</italic>, and <italic>Academic Discipline</italic> as the independent variables in the <italic>Download</italic> equation. These independent variables were selected based on empirical studies exploring the determinants of subscription prices (<xref ref-type="bibr" rid="r4">Chressanthis &#x0026; Chressanthis, 1994</xref>; <xref ref-type="bibr" rid="r6">Coomes et al., 2017</xref>; <xref ref-type="bibr" rid="r17">Petersen, 1992</xref>). Previous studies found that downloads are concentrated around the time of article release, as mentioned in the Introduction. Therefore, in this study, <italic>Article</italic> represents the number of articles published in 2022, although readers can download articles published before 2022. <italic>OpenRatio</italic> refers to the proportion of open access articles to the number of articles in 2022, measured as a percentage. <italic>CiteScore</italic> for 2021 is the number of citations from 2018 to 2021 divided by the number of articles published during the same period. <italic>CiteScore</italic> for 2021 is the latest citation score available when downloading articles in 2022. <italic>Issue</italic> denotes the number of issues published in the year 2022. <italic>Year</italic> is the number of years since the journal&#x2019;s inception and is calculated by subtracting the inception year from 2022. Springer Nature publishes journals on behalf of academic societies, universities, and other research institutions, in addition to journals Springer Nature launched independently. When a journal is published on behalf of an academic society, university, or other research institution, Springer Nature provides information on the research institution that commissions publications on the journal&#x2019;s website. If this information was lacking, we deemed the journal to have been launched independently by Springer Nature. <italic>Society</italic> is set to 1 if the journal is published on behalf of an academic society, university, or other research institution, and 0 otherwise.</p>
<p>This study used the Scopus classification for academic disciplines, including <italic>Agriculture</italic>, <italic>Arts and humanities</italic>, <italic>Chemistry</italic>, <italic>Computer science</italic>, <italic>Earth and planetary science</italic>, <italic>Engineering</italic>, <italic>Environmental science</italic>, <italic>Material science</italic>, <italic>Mathematics</italic>, <italic>Medicine</italic>, <italic>Physics</italic>, and <italic>Social sciences</italic>. The variable <italic>Agriculture</italic> is set to 1 if the journal is in the agricultural and biological sciences, and 0 otherwise. Other variables representing academic disciplines are established similarly. As <italic>Material science</italic> formed the base group when formulating the <italic>Download</italic> equation, this variable does not appear in the equation. This study sourced the number of open and non-open access articles, CiteScores, and academic disciplines from Scopus in August 2023. The number of downloads, the year of the journal&#x2019;s inception, and whether the journal was published on behalf of the associated research institution were available from the journals&#x2019; websites in the same month.</p>
</sec> 
<sec id="s3">
<title>3. Findings</title>
<p>A total of 1,485 journals collectively published 225,595 articles in 2022, and these articles were downloaded a total of 505 million times. <xref ref-type="fig" rid="fg001">Figures 1</xref> and <xref ref-type="fig" rid="fg002">2</xref> depict the number of journal downloads, ranked in descending order. A small number of journals have a large number of downloads.</p>
<fig id="fg001">
<label>Fig. 1</label>
<caption><p>Number of downloads.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/LIBER_2024_34_Assai_fig1.jpg"/></fig>
<fig id="fg002">
<label>Fig. 2</label>
<caption><p>Cumulative number of downloads.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="figures/LIBER_2024_34_Assai_fig2.jpg"/></fig>
<p><xref ref-type="table" rid="tb001">Table 1</xref> presents the relationship between the proportions of journal titles and downloads ranked in descending order. Column (a) shows the distribution of the 1,485 journals across all disciplines, whereas column (b) focuses on 355 journals in medicine with the most journals. <xref ref-type="table" rid="tb001">Table 1</xref> (a) indicates that 5&#x0025; of the journals (74 of 1,485) account for 29.9&#x0025; of downloads. Additionally, 20&#x0025; of the journals (297 of 1,485) account for 60.8&#x0025; of downloads, which is less than 80&#x0025;. The results in column (b) are almost identical to those in column (a). The results demonstrate that the Pareto principle does not apply to hybrid journals published by Springer Nature, although downloads are concentrated in a small number of journals.</p>
<table-wrap id="tb001">
<label>Table 1</label>
<caption><p>Proportion of titles and downloads (&#x0025;).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Titles</th>
<th align="left" valign="top">(a) Total</th>
<th align="left" valign="top">(b) Medicine</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">5</td>
<td align="left" valign="top">29.9</td>
<td align="left" valign="top">27.8</td>
</tr>
<tr>
<td align="left" valign="top">10</td>
<td align="left" valign="top">43.6</td>
<td align="left" valign="top">40.1</td>
</tr>
<tr>
<td align="left" valign="top">20</td>
<td align="left" valign="top">60.8</td>
<td align="left" valign="top">56.1</td>
</tr>
<tr>
<td align="left" valign="top">30</td>
<td align="left" valign="top">72.1</td>
<td align="left" valign="top">68.3</td>
</tr>
<tr>
<td align="left" valign="top">40</td>
<td align="left" valign="top">80.4</td>
<td align="left" valign="top">77.2</td>
</tr>
<tr>
<td align="left" valign="top">50</td>
<td align="left" valign="top">86.4</td>
<td align="left" valign="top">84.3</td>
</tr>
<tr>
<td align="left" valign="top">60</td>
<td align="left" valign="top">91.1</td>
<td align="left" valign="top">89.7</td>
</tr>
<tr>
<td align="left" valign="top">70</td>
<td align="left" valign="top">94.7</td>
<td align="left" valign="top">93.8</td>
</tr>
<tr>
<td align="left" valign="top">80</td>
<td align="left" valign="top">97.3</td>
<td align="left" valign="top">96.7</td>
</tr>
<tr>
<td align="left" valign="top">90</td>
<td align="left" valign="top">99.1</td>
<td align="left" valign="top">98.8</td>
</tr>
<tr>
<td align="left" valign="top">100</td>
<td align="left" valign="top">100.0</td>
<td align="left" valign="top">100.0</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><xref ref-type="table" rid="tb002">Table 2</xref> outlines the statistics of the variables for the 1,485 hybrid journals, excluding the binary variables. The coefficients of variation for <italic>Download</italic> and <italic>Article</italic> indicate large variations. Moreover, the skewness values of the two variables are 8.32 and 12.00, respectively, demonstrating that the distributions have long right tails. The CiteScore mean is 4.4. Springer Nature publishes frequently cited journals. The mean <italic>OpenRatio</italic> is 23.2, implying that many articles in hybrid journals remain non-open access.</p>
<table-wrap id="tb002">
<label>Table 2</label>
<caption><p>Summary of statistics for variables.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"></th>
<th align="left" valign="top"><italic>Download</italic></th>
<th align="left" valign="top"><italic>Article</italic></th>
<th align="left" valign="top"><italic>OpenRatio</italic></th>
<th align="left" valign="top"><italic>CiteScore</italic></th>
<th align="left" valign="top"><italic>Issue</italic></th>
<th align="left" valign="top"><italic>Year</italic></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Mean</td>
<td align="left" valign="top">339,954</td>
<td align="left" valign="top">152</td>
<td align="left" valign="top">23.2</td>
<td align="left" valign="top">4.4</td>
<td align="left" valign="top">6.4</td>
<td align="left" valign="top">34.0</td>
</tr>
<tr>
<td align="left" valign="top">Median</td>
<td align="left" valign="top">179,356</td>
<td align="left" valign="top">83</td>
<td align="left" valign="top">20.4</td>
<td align="left" valign="top">3.9</td>
<td align="left" valign="top">6.0</td>
<td align="left" valign="top">29.0</td>
</tr>
<tr>
<td align="left" valign="top">SD</td>
<td align="left" valign="top">564,698</td>
<td align="left" valign="top">263</td>
<td align="left" valign="top">16.6</td>
<td align="left" valign="top">3.2</td>
<td align="left" valign="top">4.2</td>
<td align="left" valign="top">25.6</td>
</tr>
<tr>
<td align="left" valign="top">CV (&#x0025;)</td>
<td align="left" valign="top">166</td>
<td align="left" valign="top">173</td>
<td align="left" valign="top">71.5</td>
<td align="left" valign="top">72.2</td>
<td align="left" valign="top">65.0</td>
<td align="left" valign="top">75.4</td>
</tr>
<tr>
<td align="left" valign="top">Skewness</td>
<td align="left" valign="top">8.32</td>
<td align="left" valign="top">12.00</td>
<td align="left" valign="top">0.85</td>
<td align="left" valign="top">3.84</td>
<td align="left" valign="top">3.59</td>
<td align="left" valign="top">2.14</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>SD: standard deviation; CV: coefficient of variation; <italic>Download</italic>: number of downloads in 2022; <italic>Article</italic>: number of articles published in a journal in 2022; <italic>OpenRatio</italic>: proportion of open access articles to total articles published in a journal in 2022 (&#x0025;); <italic>CiteScore</italic>: CiteScore in 2021; <italic>Issue</italic>: number of issues in 2022; <italic>Year</italic>: number of years since the journal&#x2019;s inception.</p>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="tb003">Table 3</xref> reports the estimation results of <italic>Download</italic> using OLS. Columns (a) and (b) display the results for journals across all disciplines and medicine only, respectively. The coefficient of <italic>Article</italic> (0.6297) in column (a) is positive at the 1&#x0025; significance level, as expected. The coefficient of <italic>OpenRatio</italic> (0.2522) is also positive at the 1&#x0025; significance level. This result implies that open access articles are more frequently downloaded than non-open access articles, confirming the findings of prior studies (<xref ref-type="bibr" rid="r1">Asai, 2022</xref>; <xref ref-type="bibr" rid="r7">Davis, 2011</xref>; <xref ref-type="bibr" rid="r22">Wang et al., 2015</xref>).</p>
<table-wrap id="tb003">
<label>Table 3</label>
<caption><p>Estimation results.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2" align="left" valign="top">Variables</th>
<th colspan="2" align="left" valign="top">Coefficients<hr/></th>
</tr>
<tr>
<th align="left" valign="top">(a) Total</th>
<th align="left" valign="top">(b) Medicine</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><italic>Constant</italic></td>
<td align="left" valign="top">6.8471 (0.1330)***</td>
<td align="left" valign="top">7.6901 (0.1543)***</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Article</italic></td>
<td align="left" valign="top">0.6297 (0.0185)***</td>
<td align="left" valign="top">0.5486 (0.0321)***</td>
</tr>
<tr>
<td align="left" valign="top"><italic>OpenRatio</italic></td>
<td align="left" valign="top">0.2522 (0.0163)***</td>
<td align="left" valign="top">0.2259 (0.0344)***</td>
</tr>
<tr>
<td align="left" valign="top"><italic>CiteScore</italic></td>
<td align="left" valign="top">0.5172 (0.0242)***</td>
<td align="left" valign="top">0.5375 (0.0427)***</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Issue</italic></td>
<td align="left" valign="top">0.2036 (0.0328)***</td>
<td align="left" valign="top">0.2175 (0.0553)***</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Year</italic></td>
<td align="left" valign="top">0.1757 (0.0200)***</td>
<td align="left" valign="top">0.1583 (0.0396)**</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Society</italic></td>
<td align="left" valign="top">&#x2212;0.1198 (0.0283)***</td>
<td align="left" valign="top">&#x2212;0.1000 (0.0507)**</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Agriculture</italic></td>
<td align="left" valign="top">0.2453 (0.1060)**</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Arts and Humanities</italic></td>
<td align="left" valign="top">0.5039 (0.1161)***</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Chemistry</italic></td>
<td align="left" valign="top">0.2895 (0.1171)**</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Computer science</italic></td>
<td align="left" valign="top">&#x2212;0.0603 (0.1161)</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Earth and planetary science</italic></td>
<td align="left" valign="top">&#x2212;0.0492 (0.1182)</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Engineering</italic></td>
<td align="left" valign="top">&#x2212;0.0140 (0.1066)</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Environmental science</italic></td>
<td align="left" valign="top">0.1687 (0.1264)</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Mathematics</italic></td>
<td align="left" valign="top">&#x2212;0.3703 (0.1064)***</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Medicine</italic></td>
<td align="left" valign="top">0.3840 (0.1009)***</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Physics</italic></td>
<td align="left" valign="top">&#x2212;0.0708 (0.1239)</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Social sciences</italic></td>
<td align="left" valign="top">0.5277 (0.1034)***</td>
<td align="left" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">Adjusted R<sup>2</sup></td>
<td align="left" valign="top">0.7709</td>
<td align="left" valign="top">0.7899</td>
</tr>
<tr>
<td align="left" valign="top">Number of observations</td>
<td align="left" valign="top">1,485</td>
<td align="left" valign="top">355</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Standard errors are presented in parentheses.</p>
<p>***, **, and * denote significance at the 1&#x0025;, 5&#x0025;, and 10&#x0025; levels, respectively.</p>
</table-wrap-foot>
</table-wrap>
<p>The coefficient of <italic>CiteScore</italic> (0.5172) is positive at the 1&#x0025; significance level. The positive value suggests that citation score is one factor in selecting journals for reading. The coefficient of <italic>Year</italic> (0.1757) is positive at the 1&#x0025; significance level. This finding has two implications. First, journals with long publishing histories tend to have many loyal readers, leading to a large number of downloads. Second, articles published before 2022 can be downloaded in 2022. Journals that have existed for many years tend to publish numerous articles. Therefore, <italic>Year</italic> represents the number of downloadable articles in addition to journal popularity, although the two effects cannot be separated. The coefficient of <italic>Society</italic> is negative at the 1&#x0025; significance level. The findings indicate that journals published independently by Springer Nature have more downloads than those published on behalf of academic societies and other research institutions. For journals published independently by Springer Nature, the publisher may carefully assess demand before deciding whether to launch a journal. The coefficients of <italic>Arts and Humanities</italic>, <italic>Medicine</italic>, and <italic>Social sciences</italic> are positive at the 1&#x0025; significance level, indicating a large number of downloads in a journal. Conversely, the number of downloads in a mathematical journal is small due to the negative coefficient of <italic>Mathematics</italic>. The estimated results for medical journals are similar to those across all disciplines.</p>
</sec> 
<sec id="s4">
<title>4. Discussion and Conclusion</title>
<p><xref ref-type="table" rid="tb001">Table 1</xref> indicates a concentration of downloads in a small number of journals, although they do not follow the Pareto principle. It is reasonable to consider whether to continue the Big Deal contract by calculating the cost per download and comparing it with the pay-per-view and interlibrary loans (<xref ref-type="bibr" rid="r3">Chamberlain, 2022</xref>; <xref ref-type="bibr" rid="r16">Pedersen et al., 2014</xref>). On the one hand, a publisher&#x2019;s marginal costs for distributing an article are nearly zero. Even if publishers permit electronic access to all journals, the incremental costs of distribution are small. On the other hand, libraries that subscribe to individual journals tend to decrease the number of subscription journals owing to the increased subscription prices. However, as transformative agreements predetermine the payments during the contract period, publishers can maintain their revenues through transformative agreements. Moreover, libraries and researchers in the institutions may favor the bundling services of electronic journals based on the transformative agreements at present because the number of journals available increases significantly. However, if the prices specified in transformative agreements continue to increase, libraries will face a similar problem as experienced in Big Deal&#x2014;the difficulties in purchasing books and other publishers&#x2019; journals owing to the constrained library budgets other than the bundling services (<xref ref-type="bibr" rid="r10">Edlin &#x0026; Rubinfeld, 2004</xref>). Although the initial purpose of transformative agreements is to promote open access, bundling services would stiffen library budgets. Consequently, sales of other publishers without transformative agreements decline, and leading publishers with transformative agreements may enhance their market power in the academic journal market. Therefore, we should carefully monitor the prices of the transformative agreements and examine the usefulness of the bundling services.</p>
<p>Previous studies demonstrated that the number of downloads of open access articles was higher than that of non-open access articles (<xref ref-type="bibr" rid="r1">Asai, 2022</xref>; <xref ref-type="bibr" rid="r7">Davis, 2011</xref>; <xref ref-type="bibr" rid="r22">Wang et al., 2015</xref>). Our findings also indicate that open access increases downloads. Therefore, open access is beneficial for disseminating research. Regarding the 1,485 hybrid journals examined in this study, the proportion of open access articles ranged from 0.4&#x0025; to 88.9&#x0025;. Thus, the progress of open access varies across journals. Even if universities canceled subscriptions to hybrid journals with many open access articles, researchers in the institutions would not have great difficulty accessing the articles. Therefore, it is reasonable for university libraries to decide to subscribe to journals based on the level of progress of open access in addition to the fitness of the research domains and the frequency of citations.</p>
<p>This study has some limitations. Our findings cannot be generalized to overall journals, as this study investigated hybrid journals published only by Springer Nature. Second, it considered the distribution of journal demand without taking into account subscription payments because some research institutions purchase journals at unpublished prices that differ from list prices. The relationship between demand and price remains unclear, and more research is warranted.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="r1"><mixed-citation>Asai, S. (2022). Access patterns of electronic articles: The case study of Scientometrics. <italic>Publishing Research Quarterly</italic>, <italic>38</italic>(2), 295&#x2013;307. <ext-link ext-link-type="doi" xlink:href="10.1007/s12109-022-09887-0">https://doi.org/10.1007/s12109-022-09887-0</ext-link></mixed-citation></ref>
<ref id="r2"><mixed-citation>Bergstrom, T. C. (2001). Free labour for costly journals? <italic>Journal of Economic Perspectives</italic>, <italic>15</italic>(4), 183&#x2013;198. <ext-link ext-link-type="doi" xlink:href="10.1257/jep.15.4.183">https://doi.org/10.1257/jep.15.4.183</ext-link></mixed-citation></ref>
<ref id="r3"><mixed-citation>Chamberlain, S. (2022). Lessons learned from reevaluating Big Deals with Unsub. <italic>Serials Review</italic>, <italic>48</italic>(3&#x2013;4), 234&#x2013;237. <ext-link ext-link-type="doi" xlink:href="10.1080/00987913.2022.2132090">https://doi.org/10.1080/00987913.2022.2132090</ext-link></mixed-citation></ref>
<ref id="r4"><mixed-citation>Chressanthis, G. A., &#x0026; Chressanthis, J. D. (1994). The determinants of library subscription prices of the top-ranked economics journals: An econometric analysis. <italic>The Journal of Economic Education</italic>, <italic>25</italic>(4), 367&#x2013;382. <ext-link ext-link-type="doi" xlink:href="10.1080/00220485.1994.10844848">https://doi.org/10.1080/00220485.1994.10844848</ext-link></mixed-citation></ref>
<ref id="r5"><mixed-citation>Congleton, R. D., Marsella, A., &#x0026; Cardazzi, A. J. (2022). Readership and citations as alternative measures of impact. <italic>Constitutional Political Economy</italic>, <italic>33</italic>(1), 100&#x2013;114. <ext-link ext-link-type="doi" xlink:href="10.1007/s10602-021-09333-x">https://doi.org/10.1007/s10602-021-09333-x</ext-link></mixed-citation></ref>
<ref id="r6"><mixed-citation>Coomes, O. T., Moore, T. R., &#x0026; Breau, S. (2017). The price of journals in geography. <italic>The Professional Geographer</italic>, <italic>69</italic>(2), 251&#x2013;262. <ext-link ext-link-type="doi" xlink:href="10.1080/00330124.2016.1229624">https://doi.org/10.1080/00330124.2016.1229624</ext-link></mixed-citation></ref>
<ref id="r7"><mixed-citation>Davis, P. M. (2011). Open access, readership, citations: A randomized controlled trial of scientific journal publishing. <italic>The FASEB Journal: Official Publication of the Federation of American Societies for Experimental Biology</italic>, <italic>25</italic>(7), 2129&#x2013;2134. <ext-link ext-link-type="doi" xlink:href="10.1096/fj.11-183988">https://doi.org/10.1096/fj.11-183988</ext-link></mixed-citation></ref>
<ref id="r8"><mixed-citation>Davis, P. M. (2013). Public accessibility of biomedical articles from PubMed Central reduces journal readership &#x2013; Retrospective cohort analysis. <italic>The FASEB Journal</italic>, <italic>27</italic>(7), 2536&#x2013;2541. <ext-link ext-link-type="doi" xlink:href="10.1096/fj.13-229922">http://doi.org/10.1096/fj.13-229922</ext-link></mixed-citation></ref>
<ref id="r9"><mixed-citation>Ding, Y., Dong, X., Bu, Y., Zhang, B., Lin, K., &#x0026; Hu, B. (2021). Revisiting the relationship between downloads and citations: A perspective from papers with different citation patterns in the case of the Lancet. <italic>Scientometrics</italic>, <italic>126</italic>(9), 7609&#x2013;7621. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-021-04099-3">https://doi.org/10.1007/s11192-021-04099-3</ext-link></mixed-citation></ref>
<ref id="r10"><mixed-citation>Edlin, A. S., &#x0026; Rubinfeild, D. L. (2004). Exclusion or efficient pricing? The &#x2018;Big Deal&#x2019; bundling of academic journals. <italic>Antitrust Law Journal</italic>, <italic>72</italic>(1), 119&#x2013;158.</mixed-citation></ref>
<ref id="r11"><mixed-citation>Emrani, E., Moradi-Salari, A., &#x0026; Jamali, H. R. (2010). Usage data, e-journal selection, and negotiations: An Iranian consortium experience. <italic>Serials Review</italic>, <italic>36</italic>(2), 86&#x2013;92. <ext-link ext-link-type="doi" xlink:href="10.1080/00987913.2010.10765289">https://doi.org/10.1080/00987913.2010.10765289</ext-link></mixed-citation></ref>
<ref id="r12"><mixed-citation>Fern&#x00E1;ndez-Ramos, A., Rodr&#x00ED;guez-Bravo, B., &#x0026; Diez-Diez, &#x00C1;. (2023). Use of scientific journals in Spanish universities: Analysis of the relationship between citations and downloads in two university library consortia. <italic>Scientometrics</italic>, <italic>128</italic>(4), 2489&#x2013;2505. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-023-04670-0">https://doi.org/10.1007/s11192-023-04670-0</ext-link></mixed-citation></ref>
<ref id="r13"><mixed-citation>Loan, F. A., &#x0026; Mushtaq, R. (2023). Pareto 80/20 rule: A new principle for serial subscription. <italic>Serials Review</italic>, <italic>49</italic>(1&#x2013;2), 10&#x2013;14. <ext-link ext-link-type="doi" xlink:href="10.1080/00987913.2023.2179783">https://doi.org/10.1080/00987913.2023.2179783</ext-link></mixed-citation></ref>
<ref id="r14"><mixed-citation>Moed, H. F. (2005). Statistical relationships between downloads and citations at the level of individual documents within a single journal. <italic>Journal of the American Society for Information Science and Technology</italic>, <italic>56</italic>(10), 1088&#x2013;1097. <ext-link ext-link-type="doi" xlink:href="10.1002/asi.20200">https://doi.org/10.1002/asi.20200</ext-link></mixed-citation></ref>
<ref id="r15"><mixed-citation>Olsson, L., Lindel&#x00F6;w, C. H., &#x00D6;sterlund, L., &#x0026; Jakobsson, F. (2020). Swedish researchers&#x2019; responses to the cancellation of the big deal with Elsevier. <italic>Insights the UKSG Journal</italic>, <italic>33</italic>, Article 25. <ext-link ext-link-type="doi" xlink:href="10.1629/uksg.521">https://doi.org/10.1629/uksg.521</ext-link></mixed-citation></ref>
<ref id="r16"><mixed-citation>Pedersen, W. A., Arcand, J., &#x0026; Forbis, M. (2014). The Big Deal, interlibrary loan, and building the user-centered journal collection: A case study. <italic>Serials Review</italic>, <italic>40</italic>(4), 242&#x2013;250. <ext-link ext-link-type="doi" xlink:href="10.1080/00987913.2014.975650">https://doi.org/10.1080/00987913.2014.975650</ext-link></mixed-citation></ref>
<ref id="r17"><mixed-citation>Petersen, H. C. (1992). The economics of economics journals: A statistical analysis of pricing practices by publishers (Research Note). <italic>College and Research Libraries</italic>, <italic>53</italic>(2), 176&#x2013;181. <ext-link ext-link-type="doi" xlink:href="10.5860/crl_53_02_176">https://doi.org/10.5860/crl_53_02_176</ext-link></mixed-citation></ref>
<ref id="r18"><mixed-citation>Rodr&#x00ED;guez-Bravo, B., Fern&#x00E1;ndez-Ramos, A., De-la-Mano, M., &#x0026; Vianello-Osti, M. (2021) The evolution and revision of big deals: A review from the perspective of libraries. <italic>El Profesional de la Informaci&#x00F3;n</italic>, <italic>30</italic>(4), 1699&#x2013;2407. <ext-link ext-link-type="doi" xlink:href="10.3145/epi.2021.jul.15">https://doi.org/10.3145/epi.2021.jul.15</ext-link></mixed-citation></ref>
<ref id="r19"><mixed-citation>Schloegl, C., &#x0026; Gorraiz, J. (2010). Comparison of citation and usage indicators: The case of oncology journals. <italic>Scientometrics</italic>, <italic>82</italic>(3), 567&#x2013;580. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-010-0172-1">https://doi.org/10.1007/s11192-010-0172-1</ext-link></mixed-citation></ref>
<ref id="r20"><mixed-citation>Schl&#x00F6;gl, C., Gorraiz, J., Gumpenberger, C., Jack, K., &#x0026; Kraker, P. (2014). Comparison of downloads, citations and readership data for two information systems journals. <italic>Scientometrics</italic>, <italic>101</italic>(2), 1113&#x2013;1128. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-014-1365-9">https://doi.org/10.1007/s11192-014-1365-9</ext-link></mixed-citation></ref>
<ref id="r21"><mixed-citation>Vaughan, L., Tang, J., &#x0026; Yang, R. (2017). Investigating disciplinary differences in the relationships between citations and downloads. <italic>Scientometrics</italic>, <italic>111</italic>(3), 1533&#x2013;1545. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-017-2308-z">https://doi.org/10.1007/s11192-017-2308-z</ext-link></mixed-citation></ref>
<ref id="r22"><mixed-citation>Wang, X., Liu, C., Mao, W., &#x0026; Fang, Z. (2015). The open access advantage considering citation, article usage and social media attention. <italic>Scientometrics</italic>, <italic>103</italic>(2), 555&#x2013;564. <ext-link ext-link-type="doi" xlink:href="10.1007/s11192-015-1547-0">https://doi.org/10.1007/s11192-015-1547-0</ext-link></mixed-citation></ref>
<ref id="r23"><mixed-citation>Watson, A. B. (2007). Measuring demand for online articles at the <italic>Journal of Vision</italic>. <italic>Journal of Vision</italic>, <italic>7</italic>(7), 1&#x2013;3. <ext-link ext-link-type="doi" xlink:href="10.1167/7.7.i">https://doi.org/10.1167/7.7.i</ext-link></mixed-citation></ref>
<ref id="r24"><mixed-citation>Zhu, Q., &#x0026; Xiang, H. (2016). Differences of Pareto principle performance in e-resource download distribution: An empirical study. <italic>The Electronic Library</italic>, <italic>34</italic>(5), 846&#x2013;855. <ext-link ext-link-type="doi" xlink:href="10.1108/el-05-2015-0068">https://doi.org/10.1108/EL-05-2015-0068</ext-link></mixed-citation></ref>
</ref-list>
</back>
</article>