Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How exactly is this used #2

Open
ccurelea opened this issue May 9, 2020 · 4 comments
Open

How exactly is this used #2

ccurelea opened this issue May 9, 2020 · 4 comments

Comments

@ccurelea
Copy link

ccurelea commented May 9, 2020

Are the Params the input to the search page?

Then does the script just download all of the pdfs that appear?

@ccurelea
Copy link
Author

ccurelea commented May 9, 2020

What is significance of INDUSTRIES = '046,047,005,006,058,025'
cant this just be blank?

Thanks

@pudo
Copy link
Owner

pudo commented May 10, 2020

Hey! So to answer both questions: yes, the PARAMS are what is needed to fake the main search form. My use case in developing this was to download only filings from companies in the extractive industries sector, which is what the industry codes listed in INDUSTRIES are. You could try and leave this empty, or make it explicitly iterate over every industry code in the site dropdown if you want to download all documents.

That will take half an eternity, however.

@ccurelea
Copy link
Author

Thanks for the answer! Really cool script here very useful!

When load_filings() gets called the params get inputted and it performs the search.

I used a print(res.url) to verify.

I cant seem to pull table rows from the result page. I used another print statement and it returns an empty array.

Screen Shot 2020-05-10 at 6 51 10 PM

Do you have any suggestions?

Thanks again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants