# eScholarship at UC Berkeley

![escholarship_logo](./images/escholarship_logo.png)

**eScholarship Repository** offers preservation and dissemination services for a wide range of scholarship including working papers, electronic theses and dissertations (ETDs), student capstone projects, and paper/seminar series. eScholarship Repository is also the primary destination for researchers depositing their previously published journal articles in accordance with the Academic Senateâ€™s UC Open Access Policy.

[https://escholarship.org/uc/ucb/about](https://escholarship.org/uc/ucb/about)

To deposit your work as a UC Berkeley affiliate, click on the "Deposit" button and follow the instructions according to your affiliation.

![Deposit_by_affiliation](./images/deposit.png)

In [14]:
!pip3 install requests_html

[33mYou are using pip version 19.0.3, however version 21.1.3 is available.
You should consider upgrading via the 'pip install --upgrade pip' command.[0m


In [15]:
import requests
import pandas as pd
from requests_html import HTML
from requests_html import HTMLSession

In [16]:
def get_source(url):
    """Return the source code for the provided URL. 

    Args: 
        url (string): URL of the page to scrape.

    Returns:
        response (object): HTTP response object from requests_html. 
    """

    try:
        session = HTMLSession()
        response = session.get(url)
        return response

    except requests.exceptions.RequestException as e:
        print(e)

In [17]:
url = "https://escholarship.org/rss/unit/ucb_postprints"

In [98]:
def get_feed(url):
    """Return a Pandas dataframe containing the RSS feed contents.

    Args: 
        url (string): URL of the RSS feed to read.

    Returns:
        df (dataframe): Pandas dataframe containing the RSS feed contents.
    """
    
    response = get_source(url)
    
    df = pd.DataFrame(columns = ['title', 'date', 'guid'])

    with response as r:
        items = r.html.find("item", first=False)

        for item in items:        

            title = item.find('title', first=True).text
            guid = item.find('guid', first=True).text
            date = item.find('pubdate', first=True).text

            row = {'title': title, 'guid': guid, 'date': date}
            df = df.append(row, ignore_index=True)

    return df

In [117]:
df = get_feed(url)
df.head(10)

Unnamed: 0,title,date,guid
0,Capturing the Complex Histories of German Worl...,"Wed, 30 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/48q30649
1,A Co(II)-based Molecular Magnet with a 6 T Coe...,"Mon, 28 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/7tt0n82b
2,Background-free dual-mode optical and <sup>13<...,"Mon, 28 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/2q38g0g5
3,Actinide tetra-N-heterocyclic carbene 'sandwic...,"Thu, 24 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/2j8503k9
4,Moat: Verifying confidentiality of enclave pro...,"Thu, 24 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/0qg1g2nm
5,Observing and Modeling the Sequential Pairwise...,"Thu, 24 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/0mt7m53c
6,At the intersection of immigration and welfare...,"Wed, 23 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/7dp0m5v3
7,Does State Unemployment Insurance Modernizatio...,"Wed, 23 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/6fh4231d
8,Sterile neutrino dark matter and leptogenesis ...,"Tue, 22 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/9h95x481
9,Predictions for axion couplings from ALP cogen...,"Tue, 22 Jun 2021 00:00:00 +0000",https://escholarship.org/uc/item/436453p7


In [118]:
df['date'] = df['date'].str.slice(stop=16)

## Ten Most Recent Postprints in eScholarship

In [119]:
df = df.rename(columns={'title': 'Title', 'date': 'Date','guid':'Link'})
df.head(10)

Unnamed: 0,Title,Date,Link
0,Capturing the Complex Histories of German Worl...,"Wed, 30 Jun 2021",https://escholarship.org/uc/item/48q30649
1,A Co(II)-based Molecular Magnet with a 6 T Coe...,"Mon, 28 Jun 2021",https://escholarship.org/uc/item/7tt0n82b
2,Background-free dual-mode optical and <sup>13<...,"Mon, 28 Jun 2021",https://escholarship.org/uc/item/2q38g0g5
3,Actinide tetra-N-heterocyclic carbene 'sandwic...,"Thu, 24 Jun 2021",https://escholarship.org/uc/item/2j8503k9
4,Moat: Verifying confidentiality of enclave pro...,"Thu, 24 Jun 2021",https://escholarship.org/uc/item/0qg1g2nm
5,Observing and Modeling the Sequential Pairwise...,"Thu, 24 Jun 2021",https://escholarship.org/uc/item/0mt7m53c
6,At the intersection of immigration and welfare...,"Wed, 23 Jun 2021",https://escholarship.org/uc/item/7dp0m5v3
7,Does State Unemployment Insurance Modernizatio...,"Wed, 23 Jun 2021",https://escholarship.org/uc/item/6fh4231d
8,Sterile neutrino dark matter and leptogenesis ...,"Tue, 22 Jun 2021",https://escholarship.org/uc/item/9h95x481
9,Predictions for axion couplings from ALP cogen...,"Tue, 22 Jun 2021",https://escholarship.org/uc/item/436453p7
