root/google/scholar.google.com/scholar/publishers.html

Revision 61, 13.7 kB (checked in by Myc, 9 years ago)

Removed system message that image links are not monitored.

From now on, image references in documents are simply discarded
without explicit notice in the document representations (i.e.,
GooDiff's "stripped down" version of the original documents).

Line 
1 [ ](http://scholar.google.com) 
2 ** Support for Scholarly Publishers**
3  
4
5 [About Google Scholar](about.html)
6
7 [Google Scholar Help](help.html)
8
9 [Advanced Search Tips](refinesearch.html)
10
11 [Support for Libraries](libraries.html)
12
13 **Support for Publishers**
14
15 [Add Google Scholar 
16 to your site](https://services.google.com/inquiry/scholar_email?hl=en)
17
18  
19
20
21 Find library resources with [Library Links](librarylinks.html)
22
23 Google and Google Scholar can boost the worldwide visibility and accessibility of your content. We work with publishers of scholarly information to index peer-reviewed papers, theses, preprints, abstracts, and technical reports from all disciplines of research and make them searchable on Google and Google Scholar. This page provides policy and technical information for scholarly publishers and societies.
24
25 ** Publisher Policies**
26  
27 **Multiple versions of a work are grouped to improve its ranking.** In many research areas versions of a work may appear as preprints and conference papers before being published as a journal article. These preliminary versions of a work are often cited in addition to the authoritative journal version. The number of citations to a particular work is an important part of determining its rank in the Google Scholar search results. Grouping versions allows us to collect all citations to all versions of a work. In practice, this can significantly improve the position of an article in the search results.
28
29 **Publisher's full-text, if indexed, is the primary version.** When multiple versions of a work are indexed, we select the full and authoritative text from the publisher as the primary version. We can only do this if we are able to successfully identify, crawl and process the full text of the publisher's version.
30
31 **Publishers have control over access to their articles.** We work with publishers to preserve their control over access to their content and only cache articles and papers that are not access-controlled. Publishers can help us by identifying which regions of their sites are access-controlled. For details, please [click here](http://www.google.com/remove.html#uncache).
32
33 ** Google users must be offered at least a complete abstract.** This is a crucial component of our indexing program. For papers with access restrictions, a full author-written abstract will help users choose among the results which paper is the most likely to have the information they are looking for.   
34  
35 **We will respond to complaints regarding copyright infringement. **Our policy is to  respond to all notices of alleged copyright infringement that comply with the  Digital Millennium Copyright Act. For directions and more  information, please [click here](http://www.google.com/dmca.html).
36
37 ** Frequently Asked Questions**
38
39 **Common Questions**
40
41   1. [I'm a publisher of scholarly works and would like to have my content included in Google and Google Scholar?](#faq1)
42   2. [What are the benefits of including my content in Google and Google Scholar?](#faq2)
43   3. [My content is available to subscribers only. Can you include it in Google Scholar?](#faq3)
44   4. [Only the abstracts for my articles are online. Can you index them?](#faq4)
45   5. [I publish magazines. Can my articles be part of Google Scholar?](#faq5)
46   6. [I publish scholarly textbooks and monographs. Can my content be included in Google Scholar?](#faq6)
47   7. [I represent a professional society. Can you index our research articles?](#faq7)
48   8. [Can you index my school's dissertations and technical reports?](#faq8)
49   9. [Can I see usage statistics for my content?](#faq9)
50   10. [Do you charge for referrals from Google Scholar search results?](#faq10)
51   11. [What do I do if I believe you're linking to a webpage that infringes my copyright?](#faq11)
52
53 **Technical Questions**
54
55   1. [My articles are in PDF format. Can you still index my site?](#tech1)
56   2. [How can I tell if a PDF file has searchable text?](#tech2)
57   3. [Many of my articles are PDFs with scanned images. Can you index these?](#tech3)
58   4. [Some of my articles are split into multiple files, one file per section. Can you work with these?](#tech4)
59   5. [I generate a single PDF file for each issue of my publication. This file includes many articles. Can this be included in Google Scholar?](#tech5)
60   6. [Articles in PDF format can be large, and it's easy for me to extract the text from each article. Can you work with just the sequence of words for each document?](#tech6)
61   7. [You're crawling the full text of my work, but I still don't see that as the first link.](#tech7)
62   8. [I see a 'cached' (or 'View as HTML') link for my access-controlled articles. I need to have this fixed right away!](#tech8)
63   9. [How do you order query results?](#tech9)
64   10. [Is there anything I can do to help rank my articles better?](#tech10)
65   11. [All my articles are available to your crawlers, but not all of them seem to show up in Google Scholar. Can I do something to help improve coverage?](#tech11)
66   12. [I have some questions about how the Google Scholar crawler works. Where should I look?](#tech12)
67
68  
69
70
71 **Common Questions**
72
73   1. **I'm a publisher of scholarly works and would like to have my content included in Google and Google Scholar? 
74  
75 **Your content is most welcome. If your works are already online, we may need nothing more than your permission for our crawlers to visit your site. As noted above, an abstract (at least) of each work must be available to non-subscribers who come from Google and Google Scholar. Please [contact us](http://www.google.com/support/scholar/bin/request.py) to discuss the details. 
76  
77   2. **What are the benefits of including my content in Google and Google Scholar? 
78  
79 **Google and Google Scholar can boost the worldwide visibility and accessibility of your content. Our experience shows that scholarly publishers who make their work available to Google search services usually see a significant increase in traffic. Similar results have been reported at several publishing conferences and workshops. 
80  
81   3. **My content is available to subscribers only. Can you include it in Google Scholar? 
82  
83 **Absolutely, as long as you can show a complete abstract (or more) to all users who arrive from Google and Google Scholar. In this way you can let web searchers know your work exists, while still limiting full access to subscribers. Please [contact us](http://www.google.com/support/scholar/bin/request.py) for details. 
84  
85   4. **Only the abstracts for my articles are online. Can you index them? 
86  
87 **We strongly recommend indexing full-text versions of your work if you want the most powerful results that Google can provide. But we do work with abstracts, as long as all visitors from Google and Google Scholar can see these abstracts in their entirety.   
88  
89   5. **I publish magazines. Can my articles be part of Google Scholar? 
90  
91 **Google Scholar focuses on scholarly content. We are unable to include popular or trade magazines. If you believe your magazines are suitable primarily for a scholarly audience, please [contact us](http://www.google.com/support/scholar/bin/request.py). 
92  
93   6. **I publish scholarly textbooks and monographs. Can my content be included in Google Scholar? 
94  
95 **For now, Google Scholar indexes only scholarly articles. For textbooks and monographs, we recommend [Google Book Search](http://books.google.com).   
96  
97   7. **I represent a professional society. Can you index our research articles?   
98  
99 **Yes, as long as our robot software is able to crawl them online. If you don't host the content yourself, you'll need to ask your hosting provider to allow access to our crawlers. You may find we're already working with them to make content available, but it helps if you make it known you'd like this to happen. Please [contact us](http://www.google.com/support/scholar/bin/request.py) for more details.   
100  
101   8. **Can you index my school's dissertations and technical reports? 
102  
103 **Yes, as long as our robot software is able to crawl them online. Please ask the administrator of your school or department to [contact us](http://www.google.com/support/scholar/bin/request.py). 
104  
105   9. **Can I see usage statistics for my content?   
106  
107 **Since users click through to your website, your web server logs should have all the usage statistics.   
108  
109   10. **Do you charge for referrals from Google Scholar search results? 
110  
111 **Absolutely not. We have no plans to charge for Google Scholar referrals. 
112  
113   11. **What do I do if I believe you're linking to a webpage that infringes my copyright?   
114  
115 **It is our policy to respond to notices of alleged infringement that comply with the Digital Millennium Copyright Act. For directions and more information, please [click here](http://www.google.com/dmca.html).
116
117 **Technical Questions**
118
119   1. **My articles are in PDF format. Can you still index my site? 
120  
121 **Yes. We can index PDF articles as long as they're searchable. We also can index HTML, PostScript, compressed PostScript (ps.gz), and compressed PDF (pdf.gz). 
122  
123   2. **How can I tell if a PDF file has searchable text?** 
124  
125 Open the file in Adobe Acrobat Reader. Click 'Find' (look for the binocular icon), and confirm that you can search for and find several words on the page.   
126  
127   3. **Many of my articles are PDFs with scanned images. Can you index these? 
128  
129 **Possibly. We're glad to index multi-layer PDFs that have images and text in OCR format. (OCR stands for optical character recognition.) We can't index PDFs with just scanned images. If you have many scanned PDFs that lack OCR text, please [contact us](http://www.google.com/support/scholar/bin/request.py).   
130  
131   4. **Some of my articles are split into multiple files, one file per section. Can you work with these? 
132  
133 **Alas, we can't. We can index only one file per article at the moment. 
134  
135   5. **I generate a single PDF file for each issue of my publication. This file includes many articles. Can this be included in Google Scholar? 
136  
137 **Sorry, not at this time. We support only one article per file. 
138  
139   6. **Articles in PDF format can be large, and it's easy for me to extract the text from each article. Can you work with just the sequence of words for each document?   
140  
141 **We strongly recommend preserving the full PDF layout information. We rely on a document's layout to extract metadata, citations and other information which plays a significant role in relevance ranking. If document size or crawl bandwidth are issues, we can work with you to determine a suitable way to crawl your site. Please [contact us](http://www.google.com/support/scholar/bin/request.py).   
142  
143   7. **You're crawling the full text of my work, but I still don't see that as the first link? 
144  
145 **Rest assured, this is not intentional, and may be due to good old-fashioned technical difficulties. For instance, we may be unable to crawl your version, or to extract key information like author and title. If you'll [email us](mailto:scholar-publisher@google.com) with a sample query and specific results, we'll be glad to investigate.   
146  
147   8. **I see a 'cached' (or 'View as HTML') link to my access-controlled articles. I need to have this fixed right away!** 
148  
149 Of course! Please email us with specific examples of where the links appear; we'll investigate and fix as soon as possible. This is not intentional but may happen due to technical issues. For example, our methodical crawlers may accidentally discover a forgotten alternative interface to your content. You'll need to tell us of all such interfaces, because crawlers can go places where you least expect them. Please [email us](mailto:scholar-publisher@google.com) and we'll look into it.   
150  
151 If you believe another site is infringing your copyright, please see our directions on the [DMCA process](http://www.google.com/dmca.html). 
152  
153   9. **How do you order query results? 
154 **   
155 We take variety of factors into account for ordering search results. For more information, please see the Google Scholar [help page](help.html).   
156  
157   10. **Is there anything I can do to help rank my articles better? 
158  
159 **Indeed you can. Our indexing algorithms automatically extract metadata, citations and other information from articles and use them for ranking purposes. Providing authoritative metadata about your articles can help facilitate this and can increase the likelihood of identifying all the citations to your articles. We strongly recommend this approach. Please [contact us](mailto:scholar-publisher@google.com) if you would like to work with us on this. 
160  
161   11. **All my articles are available to your crawlers, but not all of them seem to show up in Google Scholar. Can I do something to help improve coverage? 
162  
163 **Based on our experience, here are some suggestions:   
164
165
166     * Make sure all your articles can be reached from your home page by following simple HTML links. Building a browse interface for your site can help, and also can help users discover the full richness of your site.
167     * Avoid using session IDs, cookies and other tracking parameters for our crawler. These provide useful information for users but hinder crawler operation, since multiple URLs are associated with each document.
168     * Provide us with a list of URLs for all your scholarly articles, along with article-level metadata for each. This facilitates both crawling and indexing.
169
170  
171  
172   12. **I have some questions about how the Google Scholar crawler works. Where should I look? 
173  
174 **The Google Scholar crawler works the same way as the main Google crawler. You can find more information about Google's crawlers as well as guidelines for webmasters on the main Google web site. 
175  
176 For more details on any of these points, please don't hesitate to [contact us](mailto:scholar-publisher@google.com). 
177
178
179    
180
181 * * *
182
183 (c)2005 Google - [Google Home](http://www.google.com) - [About Google](http://www.google.com/about.html) - [Privacy Policy](http://www.google.com/privacy.html)
184
Note: See TracBrowser for help on using the browser.