hyperspider

hyperspider is a declarative HATEOAS API crawler for node.js. Give it a list of url patterns, and it will recursively crawl your hypertext HTTP API, streaming back every matching endpoint.

hyperspider is great for folks that want to create clean, granular, and self-documenting hypertext APIs, but avoid the latency of remotely fetching hundreds of tiny HTTP resources.

Example

Let's say you had a Twitter clone with a hypertext JSON API, including the following two sample endpoints:

`GET /users/:id` (fetch a user)

{
  "href": "/users/jedschmidt",
  "name": "Jed Schmidt",

  "location":  { "href": "/japan/tokyo" },
  "updates":   { "href": "/users/jedschmidt/updates" },
  "following": { "href": "/users/jedschmidt/following" },
  "followers": { "href": "/users/jedschmidt/followers" }
}

`GET /users/:id/following` (fetch the users a user is following)

{
  "href": "/users/jedschmidt/following",
  "items": [
    { "href": "/users/janl" },
    { "href": "/users/cramforce" },
    { "href": "/users/hblank" },
    { "href": "/users/theophani" }
  ]
}

To create a single following-detailed resource so that API consumers don't have to make a separate HTTP call for each resource, use hyperspider like this:

var hyperspider = require("hyperspider")

var options = {
  host: "mytwitterclone.biz",
  path: [
    "/users/jedschmidt",
    "/users/jedschmidt/following",
    "/users/*"
  ]
})

hyperspider(options, function(err, data) {
  // data is an array with the result entities of 6 endpoints:
  // 1. /users/jedschmidt
  // 2. /users/jedschmidt/following
  // 3. /users/janl
  // 4. /users/cramforce
  // 5. /users/hblank
  // 6. /users/theophani
})

See the tests for a working example.

Installation

Use npm to install hyperspider:

npm i hyperspider

API

req = hyperspider(options, [callback])

options: An object containing the same options as for node's http.request method, with one exception: path can be any of the following:

a normal url path, such as /users
a function that takes a path and returns a boolean
a RegExp that matches a path
a wildcard url path to turn into a RegExp, with * replaced by [^/]+ and ** replaced by .*?
an array containing any of the above

Note that path must contain at least one normal url path to serve as the starting point for the crawl.

callback(err, data): Buffers the event stream into a single callback. err is null if no errors occurred, and otherwise an array of errors. data is an array of string entities, one for each successful HTTP call. Omit this to listen for a stream of events.

req.extract (or hyperspider.prototype.extract)

This method takes a single argument, an HTTP response body string. Override this before adding listeners to customize how hyperspider should extract urls from each resource. By default, the response body is parsed into JSON, extracting the value of every nested href property, but you could roll your own, such as parsing Link headers, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
hyperspider.js		hyperspider.js
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hyperspider

Example

`GET /users/:id` (fetch a user)

`GET /users/:id/following` (fetch the users a user is following)

Installation

API

req = hyperspider(options, [callback])

req.extract (or hyperspider.prototype.extract)

About

Releases

Packages

beaubol/hyperspider

Folders and files

Latest commit

History

Repository files navigation

hyperspider

Example

GET /users/:id (fetch a user)

GET /users/:id/following (fetch the users a user is following)

Installation

API

req = hyperspider(options, [callback])

req.extract (or hyperspider.prototype.extract)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

`GET /users/:id` (fetch a user)

`GET /users/:id/following` (fetch the users a user is following)

Packages