Downloading data:
cd crawler/data/raw
wget https://zenodo.org/records/7752615/files/unarXive_230324_open_subset.tar.xz?download=1
mkdir ./unarXive_230324_open_subset
tar -xf unarXive_230324_open_subset.tar.xz?download=1 -C ./unarXive_230324_open_subset
- Whatever is necessary
- Labeled sections
- Figures and tables are interpretable/ignorable
- Paper -> entities
- Essential entities
- Paper
- Task
- Method
- Results
- Dataset
- Models
- Libraries
- Non-essential entities
- Languages (natural)
- License
- Authors
- Affiliations
- Other information to collect
- Date
- Group/Merge entities
- Defining relationships between entities.
- Collection of facts about a given entity (of a specific type) -> Article
- Search
- Search bar
- Graph view
- Nodes
- Entity
- Type
- Edges
- Relationship
- Type
- Nodes
- Page preceding article generation
- See how many facts you have about a topic
- See relationships between entities
- Client side article generation with API Key
- Essential Article types
- Task
- Name
- Description
- Methods & Associated Results
- Method
- Task
- Non-essential Article types
- Paper
- Dataset
- Models
- Libraries
- Authors
- Affiliations
- Languages
- License
- Subscribe