Skip to content

Commit

Permalink
Added datasets
Browse files Browse the repository at this point in the history
  • Loading branch information
kitsiosk committed Jul 5, 2021
1 parent ee599d5 commit 53f02a2
Show file tree
Hide file tree
Showing 14 changed files with 127,149 additions and 2,584 deletions.
1,036 changes: 165 additions & 871 deletions 5AbastractsEvaluation.ipynb → ...sGroupDataset/5AbastractsEvaluation.ipynb

Large diffs are not rendered by default.

Binary file not shown.
1,252 changes: 1,252 additions & 0 deletions 5AbstractsGroupDataset/Business.txt

Large diffs are not rendered by default.

1,468 changes: 1,468 additions & 0 deletions 5AbstractsGroupDataset/CSAI.txt

Large diffs are not rendered by default.

1,006 changes: 1,006 additions & 0 deletions 5AbstractsGroupDataset/Law.txt

Large diffs are not rendered by default.

1,072 changes: 1,072 additions & 0 deletions 5AbstractsGroupDataset/Sociology.txt

Large diffs are not rendered by default.

1,448 changes: 1,448 additions & 0 deletions 5AbstractsGroupDataset/Trans.txt

Large diffs are not rendered by default.

Binary file added AGNewsDataset/AGNewsData_sgtsnepi.mat
Binary file not shown.
1,001 changes: 170 additions & 831 deletions AGNewsEvaluation.ipynb → AGNewsDataset/AGNewsEvaluation.ipynb

Large diffs are not rendered by default.

120,001 changes: 120,001 additions & 0 deletions AGNewsDataset/train.csv

Large diffs are not rendered by default.

21 changes: 21 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
## *Text Classification Using Unsupervised Learning Techniques*

This code complements my diploma thesis on "Text Classification Using Unsupervised Learning Techniques" for the Electrical and Computer Engineering Department of Aristotle University of Thessaloniki, Greece.

Author: Kitsios Konstantinos

Correspondance address: [email protected]

Code sub-routines and pretrained neural networks are used from the research papers below:

[1]: D. Cer, Y. Yang, S. yi Kong, N. Hua, N. Limtiaco, R. S. John, N. Constant,M. Guajardo-Cespedes, S. Yuan, C. Tar, Y.-H. Sung, B. Strope, and R. Kurzweil, ‘‘Universal sentence encoder,’’arXiv preprint arXiv:1803.11175, 2018.

[2]: A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever, ‘‘Language models areunsupervised multitask learners,’’, 2019.

[3]: N. Pitsianis, A. Iliopoulos, D. Floros, and X. Sun, ‘‘Spaceland embedding of sparse stochastic graphs,’’2019 IEEE High Performance Extreme Computing Conference (HPEC),pp. 1-8, 2019.

### Setup
The packages needed in order to run the code are in the *requirements.txt* file and you can install the through pip by running
`pip install requirements.txt`

The code for each dataset can be executed from the associated notebook (.ipynb)
Loading

0 comments on commit 53f02a2

Please sign in to comment.