{"id":204,"date":"2016-05-07T19:42:22","date_gmt":"2016-05-07T19:42:22","guid":{"rendered":"http:\/\/blogs.softwareclue.com\/?p=204"},"modified":"2016-05-07T19:42:22","modified_gmt":"2016-05-07T19:42:22","slug":"scrape-google-scholar","status":"publish","type":"post","link":"http:\/\/blog.softwareclues.com\/zh\/scrape-google-scholar","title":{"rendered":"Scrape Google Scholar"},"content":{"rendered":"<p>Source: <a href=\"http:\/\/lernpython.de\/scrape-google-scholar\" target=\"_blank\">http:\/\/lernpython.de\/scrape-google-scholar<\/a><\/p>\n<p><strong>Google Scholar is a useful application. It refers every publications to its authors and allows to access easily the scientific output of every researcher. Two import key indicators are the number of citations and the <a href=\"https:\/\/en.wikipedia.org\/wiki\/H-index\" target=\"_blank\">H-Index<\/a>. In this short python script you will see, how to extract\/scrape these two parameters in Python.<\/strong><\/p>\n<p><a href=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-297 size-full\" src=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\" sizes=\"(max-width: 510px) 100vw, 510px\" srcset=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=300%2C198 300w, http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?w=510 510w\" alt=\"hindex VS citations scrape Google Scholar\" width=\"510\" height=\"337\" \/><\/a><\/p>\n<p>To scrape Google Scholar we first load important libraries for this task and define a function, which is able to scrape the H-Index from a Google Scholar profile as long as we feed the function with the link to this profile. If this is the case the function returns the H-index.<\/p>\n<div id=\"WpAceEditor_1\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<div id=\"WpAceEditor_1\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<h3>Use Scholarly to scrape Google Scholar<\/h3>\n<p>In the next step we use the Python module <a href=\"https:\/\/pypi.python.org\/pypi\/scholarly\/0.1.3\">scholarly<\/a>. Is has several feature. the most important is that it can search the Google Scholar database for names and return their number of citation or the direct link to the Google profile. Hence, we give this function a list of scientist in the field of nanopores and use it to get the number of citations and link to the Google Scholar profile. This link is then fed to the previously defined function to return the H-index.<\/p>\n<div id=\"WpAceEditor_2\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<div id=\"WpAceEditor_2\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<p>We save the H-Index, number of citation and researcher name into one list and plot the two integer parameters in a plot.<\/p>\n<div id=\"WpAceEditor_3\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<div id=\"WpAceEditor_3\" class=\" ace_editor ace-monokai ace_dark\"><\/div>\n<p>The result is a plott with the number of citations on the X-axis and the H-Index on the Y-axis. From these we can deduce that with increasing number of citations the H-Index grows too. Publications analysing citations behavior in more detail can be found <a href=\"https:\/\/peerj.com\/articles\/183\/\" target=\"_blank\">here<\/a>.<\/p>\n<p><a href=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-297\" src=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\" sizes=\"(max-width: 510px) 100vw, 510px\" srcset=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=300%2C198 300w, http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?w=510 510w\" alt=\"hindex VS citations scrape Google Scholar\" width=\"510\" height=\"337\" \/><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Source: http:\/\/lernpython.de\/scrape-google-scholar Goog &hellip; <a href=\"http:\/\/blog.softwareclues.com\/zh\/scrape-google-scholar\" class=\"more-link\">\u7ee7\u7eed\u9605\u8bfb<span class=\"screen-reader-text\">\u201cScrape Google Scholar\u201d<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[3],"tags":[65,62,12],"translation":{"provider":"WPGlobus","version":"2.12.2","language":"zh","enabled_languages":["en","zh"],"languages":{"en":{"title":true,"content":true,"excerpt":false},"zh":{"title":true,"content":false,"excerpt":false}}},"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/blog.softwareclues.com\/scrape-google-scholar\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55\" \/>\n<meta property=\"og:url\" content=\"http:\/\/blog.softwareclues.com\/scrape-google-scholar\" \/>\n<meta property=\"og:site_name\" content=\"\u8f6f\u4ef6\u542f\u793a\u5f55\" \/>\n<meta property=\"article:published_time\" content=\"2016-05-07T19:42:22+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\" \/>\n<meta name=\"author\" content=\"Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar\",\"url\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar\",\"name\":\"Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55\",\"isPartOf\":{\"@id\":\"http:\/\/blog.softwareclues.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage\"},\"image\":{\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage\"},\"thumbnailUrl\":\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\",\"datePublished\":\"2016-05-07T19:42:22+00:00\",\"dateModified\":\"2016-05-07T19:42:22+00:00\",\"author\":{\"@id\":\"http:\/\/blog.softwareclues.com\/#\/schema\/person\/4c47e4e97a658930b6c0e90f4a4eda82\"},\"breadcrumb\":{\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar#breadcrumb\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/blog.softwareclues.com\/scrape-google-scholar\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage\",\"url\":\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\",\"contentUrl\":\"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"http:\/\/blog.softwareclues.com\/scrape-google-scholar#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"http:\/\/blog.softwareclues.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Scrape Google Scholar\"}]},{\"@type\":\"WebSite\",\"@id\":\"http:\/\/blog.softwareclues.com\/#website\",\"url\":\"http:\/\/blog.softwareclues.com\/\",\"name\":\"\u8f6f\u4ef6\u542f\u793a\u5f55\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"http:\/\/blog.softwareclues.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"http:\/\/blog.softwareclues.com\/#\/schema\/person\/4c47e4e97a658930b6c0e90f4a4eda82\",\"name\":\"Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"http:\/\/blog.softwareclues.com\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/2.gravatar.com\/avatar\/e4fb391d9f5bb29583ed9579324a5e17?s=96&d=mystery&r=g\",\"contentUrl\":\"http:\/\/2.gravatar.com\/avatar\/e4fb391d9f5bb29583ed9579324a5e17?s=96&d=mystery&r=g\",\"caption\":\"Editorial Team\"},\"url\":\"http:\/\/blog.softwareclues.com\/zh\/author\/admin\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/blog.softwareclues.com\/scrape-google-scholar","og_locale":"zh_CN","og_type":"article","og_title":"Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55","og_url":"http:\/\/blog.softwareclues.com\/scrape-google-scholar","og_site_name":"\u8f6f\u4ef6\u542f\u793a\u5f55","article_published_time":"2016-05-07T19:42:22+00:00","og_image":[{"url":"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337"}],"author":"Editorial Team","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"Editorial Team","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"1 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar","url":"http:\/\/blog.softwareclues.com\/scrape-google-scholar","name":"Scrape Google Scholar - \u8f6f\u4ef6\u542f\u793a\u5f55","isPartOf":{"@id":"http:\/\/blog.softwareclues.com\/#website"},"primaryImageOfPage":{"@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage"},"image":{"@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage"},"thumbnailUrl":"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337","datePublished":"2016-05-07T19:42:22+00:00","dateModified":"2016-05-07T19:42:22+00:00","author":{"@id":"http:\/\/blog.softwareclues.com\/#\/schema\/person\/4c47e4e97a658930b6c0e90f4a4eda82"},"breadcrumb":{"@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar#breadcrumb"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["http:\/\/blog.softwareclues.com\/scrape-google-scholar"]}]},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar#primaryimage","url":"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337","contentUrl":"http:\/\/i2.wp.com\/lernpython.de\/wp-content\/uploads\/2015\/06\/hindexVScitations.png?resize=510%2C337"},{"@type":"BreadcrumbList","@id":"http:\/\/blog.softwareclues.com\/scrape-google-scholar#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"http:\/\/blog.softwareclues.com\/"},{"@type":"ListItem","position":2,"name":"Scrape Google Scholar"}]},{"@type":"WebSite","@id":"http:\/\/blog.softwareclues.com\/#website","url":"http:\/\/blog.softwareclues.com\/","name":"\u8f6f\u4ef6\u542f\u793a\u5f55","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"http:\/\/blog.softwareclues.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"zh-Hans"},{"@type":"Person","@id":"http:\/\/blog.softwareclues.com\/#\/schema\/person\/4c47e4e97a658930b6c0e90f4a4eda82","name":"Editorial Team","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"http:\/\/blog.softwareclues.com\/#\/schema\/person\/image\/","url":"http:\/\/2.gravatar.com\/avatar\/e4fb391d9f5bb29583ed9579324a5e17?s=96&d=mystery&r=g","contentUrl":"http:\/\/2.gravatar.com\/avatar\/e4fb391d9f5bb29583ed9579324a5e17?s=96&d=mystery&r=g","caption":"Editorial Team"},"url":"http:\/\/blog.softwareclues.com\/zh\/author\/admin"}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paLJfj-3i","jetpack-related-posts":[],"_links":{"self":[{"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/posts\/204"}],"collection":[{"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/comments?post=204"}],"version-history":[{"count":2,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/posts\/204\/revisions"}],"predecessor-version":[{"id":206,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/posts\/204\/revisions\/206"}],"wp:attachment":[{"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/media?parent=204"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/categories?post=204"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/blog.softwareclues.com\/zh\/wp-json\/wp\/v2\/tags?post=204"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}