Objective of this assignment is to extract some sections (which are mentioned below) from SEC / EDGAR financial reports and perform text analysis to compute variables those are explained below. Link to SEC / EDGAR financial reports are given in excel spreadsheet “cik_list.xlsx”.
- pandas
- urllib
- re
- BeautifulSoup
Put the following files in a same folder:
- cik_list.xlsx
- constraining_dictionary.xlsx
- LoughranMcDonald_MasterDictionary_2016.csv
- StopWords_Generic.txt
- uncertainty_dictionary.xlsx
- BlackcofferAssignment.py
Now run the BlackcofferAssignment.py file with the required packages with an active internet connection. Wait some time as the program is going to fetch the large number of files and perform operations on them. The output file will be generated.