Skip to content

Commit

Permalink
msci. readme updates
Browse files Browse the repository at this point in the history
  • Loading branch information
Tim Molter committed Apr 13, 2016
1 parent 2c37c89 commit b4e8612
Show file tree
Hide file tree
Showing 10 changed files with 38 additions and 12 deletions.
2 changes: 1 addition & 1 deletion datasets-breast-cancer-wisconsin-orginal/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_BREAST_CANCER.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-census-income/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_CENSUS_INCOME.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-cifar10/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_CIFAR.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-hja-birdsong/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_HJA_BIRDSONG.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-lshtc4/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `LSHTC4.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-mnist/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_MNIST.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down
8 changes: 5 additions & 3 deletions datasets-nsl-kdd/README.md
Original file line number Diff line number Diff line change
@@ -1,19 +1,21 @@
## Knowm Datasets HJA Birdsong
## Knowm Datasets NSL-KDD

[raw data](http://nsl.cs.unb.ca/NSL-KDD/)

Note that the data is no longer available, but it can be accessed as described [here](https://stackoverflow.com/questions/31722843/unable-to-get-nsl-kdd-datasets).

## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_NSL_KDD.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

1. Download raw data from here: http://nsl.cs.unb.ca/NSL-KDD/
1. put all files (`KDDTrain+.txt` and `KDDTest+.txt`) in `raw` folder in project root
1. Run `RawData2DB.java`.
1. The database containing the data will appear in `db`.
1. Manually copy the `NSL_KDD.*` files into the `/usr/local/Datasets` folder. There should be three files.
1. Manually copy the `DB_NSL_KDD.*` files into the `/usr/local/Datasets` folder. There should be three files.

## Build jar containing Data

Expand Down
2 changes: 1 addition & 1 deletion datasets-numenta/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_NUMENTA.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to populate the DB with NAB Data

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -86,4 +86,28 @@ public static long selectMaxID() {
return Yank.queryScalar(SELECT_MAX, Long.class, null);
}

public static void updateName(String name, int pcbID, int pcbAnnotationID) {

Object[] params = new Object[] {

// @formatter:off
name,
pcbID,
pcbAnnotationID
// @formatter:on
};

String UPDATE = "UPDATE PCB_ANNOTATIONS SET name = ? WHERE PCBID = ? and ID = ?";

Yank.execute(UPDATE, params);
}

public static PCBAnnotation select(int pcbID, int id) {

Object[] params = new Object[] { pcbID, id };

String SELECT = "SELECT * FROM PCB_ANNOTATIONS WHERE pcbid = ? and id = ?";

return Yank.queryBean(SELECT, PCBAnnotation.class, params);
}
}
4 changes: 2 additions & 2 deletions datasets-ucsd-anomaly/README.md
Original file line number Diff line number Diff line change
@@ -1,11 +1,11 @@
## Knowm Datasets UCSD
## Knowm Datasets UCSD Anomaly Detection Dataset

[raw data](http://www.svcl.ucsd.edu/projects/anomaly/dataset.html)

## How to Get the Data

1. Manually download the database files that have been uploaded to Knowm's Google Drive account [here](https://drive.google.com/folderview?id=0ByP7_A9vXm17VXhuZzBrcnNubEE&usp=sharing#list).
1. Manually copy the `DB_PCB.*` files into the `/usr/local/Datasets` folder. There should be four files.
1. Manually copy the `DB_UCSD_ANOMALY.*` files into the `/usr/local/Datasets` folder. There should be four files.

## How to Generate Data

Expand Down

0 comments on commit b4e8612

Please sign in to comment.