mhp-p2p: frontend embedded mockups

set of “self-contained” html pages to use with custom design and code, as a way to prototype final mockup for the frontend side of the platform2platform project.

process

webpage scraping
custom design
custom js to talk with mhp-p2p database

notes

scraping

refs:

eg

wget -mkEpnp --wait=5 <url>

this is not recursive and download a single (sub-)page and related assets

wget -m -np -p https://amateurcities.com/no-happy-endings/

the wget command works for amateur-cities as well as online-open. open-set-reader is a javascript single page application, therefore there’s just one index.html file and some javascript rendering everything.

to scrape this, a very basic but effective approach — if the numbers of pages is small (eg 5 in our case) — is to combine wget with Firefox Save Page As > Web Page, Complete, and manually (or programmatic) merge data from both sides.

that is, put the assets downloaded by wget inside the html folder saved by Firefox. some urls inside each index.html need to be updated. plus, in the case of open-set-reader, pages lazy-load images. so again, rather than using puppeteer, an easy way is to scroll down the page to let all pictures download, while keeping the browser inspector open and set to the network tab; then get the origin url for each image, download it and put it in the assets/imgs folder.

Name	Name	Last commit message	Last commit date
Latest commit aptoptout added index.html file that links to all articles Feb 2, 2020 9c20dbd · Feb 2, 2020 History 30 Commits
.idea	.idea	hardcoded plugin HTML section with accompanied css and js file under …	Jan 31, 2020
amateurcities.com	amateurcities.com	hardcoded plugin HTML section with accompanied css and js file under …	Jan 31, 2020
onlineopen.org	onlineopen.org	hardcoded plugin HTML section with accompanied css and js file under …	Jan 31, 2020
openset.nl	openset.nl	hardcoded plugin HTML section with accompanied css and js file under …	Jan 31, 2020
src	src	hardcoded plugin HTML section with accompanied css and js file under …	Jan 31, 2020
.gitignore	.gitignore	removed .idea folder created by Webstorm	Jan 31, 2020
index.html	index.html	added index.html file that links to all articles	Feb 2, 2020
main.js	main.js	add placeholder	Jan 24, 2020
package-lock.json	package-lock.json	add slugify	Jan 24, 2020
package.json	package.json	add slugify	Jan 24, 2020
readme.md	readme.md	add another commmand for `wget`	Jan 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mhp-p2p: frontend embedded mockups

process

notes

scraping

About

Releases

Packages

Contributors 2

Languages

sonn-gamm/mhp-fem

Folders and files

Latest commit

History

Repository files navigation

mhp-p2p: frontend embedded mockups

process

notes

scraping

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages