Skip to content

Commit 3d3dcac

Browse files
Fixed chapter 1
1 parent 2889535 commit 3d3dcac

29 files changed

+336
-201
lines changed

book/01.Rmd

+76-77
Large diffs are not rendered by default.

book/02.Rmd

+3
Original file line numberDiff line numberDiff line change
@@ -420,6 +420,9 @@ Specifying the `--help` option also works for the GNU command-line tools such as
420420

421421
## Further Reading
422422

423+
* Heddings, Lowell. 2006. “Keyboard Shortcuts for Bash.” <a href="http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc" class="uri">http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc</a>.
424+
* Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.
425+
423426

424427

425428

book/03.Rmd

+2-1
Original file line numberDiff line numberDiff line change
@@ -388,6 +388,7 @@ For more popular APIs, there are specialized command-line tools available. These
388388

389389
## Further Reading
390390

391-
391+
* Molinaro, Anthony. 2005. <em>SQL Cookbook</em>. O’Reilly Media.
392+
* Wikipedia. 2014. “List of Http Status Codes.” <a href="http://en.wikipedia.org/wiki/List_of_HTTP_status_codes" class="uri">http://en.wikipedia.org/wiki/List_of_HTTP_status_codes</a>.
392393

393394

book/04.Rmd

+8-9
Original file line numberDiff line numberDiff line change
@@ -377,12 +377,11 @@ close(f)
377377

378378
## Further Reading
379379

380-
381-
382-
383-
384-
385-
386-
387-
388-
380+
* Docopt. 2014. “Command-Line Interface Description Language.” <a href="http://docopt.org" class="uri">http://docopt.org</a>.
381+
* Robbins, Arnold, and Nelson H. F. Beebe. 2005. <em>Classic Shell Scripting</em>. O’Reilly Media.
382+
* Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.
383+
* Perkins, Jacob. 2010. <em>Python Text Processing with Nltk 2.0 Cookbook</em>. Packt Publishing.
384+
* McKinney, Wes. 2012. <em>Python for Data Analysis</em>. O’Reilly Media.
385+
* Rossant, Cyrille. 2013. <em>Learning Ipython for Interactive Computing and Data Visualization</em>. Packt Publishing.
386+
* Wirzenius, Lars. 2013. “Writing Manual Pages.” <a href="http://liw.fi/manpages/" class="uri">http://liw.fi/manpages/</a>.
387+
* Raymond, Eric Steven. 2014. “Basics of the Unix Philosophy.” <a href="http://www.faqs.org/docs/artu/ch01s06.html" class="uri">http://www.faqs.org/docs/artu/ch01s06.html</a>.

book/05.Rmd

+4-1
Original file line numberDiff line numberDiff line change
@@ -1210,7 +1210,10 @@ $ csvsql --query 'SELECT i.sepal_length, i.sepal_width, i.species, m.usda_id '\
12101210

12111211
## Further Reading
12121212

1213-
1213+
* Molinaro, Anthony. 2005. <em>SQL Cookbook</em>. O’Reilly Media.
1214+
* Goyvaerts, Jan, and Steven Levithan. 2012. <em>Regular Expressions Cookbook</em>. 2nd Ed. O’Reilly Media.
1215+
* Dougherty, Dale, and Arnold Robbins. 1997. <em>Sed &amp; Awk</em>. 2nd Ed. O’Reilly Media.
1216+
12141217

12151218

12161219

book/06.Rmd

+1
Original file line numberDiff line numberDiff line change
@@ -326,5 +326,6 @@ We’ve only been able to scratch the surface with Drake. Some of its more advan
326326

327327
## Further Reading
328328

329+
* Factual. 2014. “Drake.” <a href="https://github.com/Factual/drake" class="uri">https://github.com/Factual/drake</a>.
329330

330331

book/07.Rmd

+4-4
Original file line numberDiff line numberDiff line change
@@ -793,7 +793,7 @@ Both `Rio` with `ggplot2` and `feedgnuplot` with Gnuplot have their advantages.
793793

794794
## Further Reading
795795

796-
797-
798-
799-
796+
* Wickham, Hadley. 2009. <em>ggplot2: Elegant Graphics for Data Analysis</em>. Springer.
797+
* Janert, Philipp K. 2009. <em>Gnuplot in Action</em>. Manning Publications.
798+
* Tukey, John W. 1977. <em>Exploratory Data Analysis</em>. Pearson.
799+

book/08.Rmd

+4
Original file line numberDiff line numberDiff line change
@@ -766,6 +766,10 @@ Once you have a basic understanding of GNU Parallel and its most important optio
766766

767767
## Further Reading
768768

769+
* Tange, O. 2011. “GNU Parallel - the Command-Line Power Tool.”<em>;Login: The USENIX Magazine</em> 36 (1). Frederiksberg, Denmark:42–47. <a href="http://www.gnu.org/s/parallel" class="uri">http://www.gnu.org/s/parallel</a>.
770+
* Tange, Ole. 2014. “GNU Parallel.” <a href="http://www.gnu.org/software/parallel" class="uri">http://www.gnu.org/software/parallel</a>.
771+
* Services, Amazon Web. 2014. “AWS Command Line Interface.” <a href="http://aws.amazon.com/cli" class="uri">http://aws.amazon.com/cli</a>.
772+
769773

770774

771775

book/09.Rmd

+4-1
Original file line numberDiff line numberDiff line change
@@ -740,7 +740,10 @@ Although we’ve only been able to experiment with one prediction API, we do bel
740740

741741
## Further Reading
742742

743-
743+
* Cortez, P., A. Cerdeira, F. Almeida, T. Matos, and J. Reis. 2009. “Modeling Wine Preferences by Data Mining from Physicochemical Properties.” <em>Decision Support Systems</em> 47 (4). Elsevier:547–53.
744+
* Hall, Mark, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. “The WEKA Data Mining Software: An Update.” <em>SIGKDD Explorations</em> 11 (1). ACM.
745+
* Pearson, K. 1901. “On Lines and Planes of Closest Fit to Systems of Points in Space.” <em>Philosophical Magazine</em> 2 (11):559–72.
746+
* Maaten, Laurens van der, and Geoffrey Everest Hinton. 2008. “Visualizing Data Using T-SNE.” <em>Journal of Machine Learning Research</em> 9:2579–2605.
744747

745748

746749

book/10.Rmd

+12-4
Original file line numberDiff line numberDiff line change
@@ -50,22 +50,30 @@ As this book is on the intersection of the command line and data science, many r
5050

5151
### APIs
5252

53+
* Russell, Matthew. 2013. <em>Mining the Social Web</em>. 2nd Ed. O’Reilly Media.
54+
* Warden, Pete. 2011. <em>Data Source Handbook</em>. O’Reilly Media.
5355

5456

5557
### Shell Programming
5658

57-
58-
59-
59+
* Winterbottom, David. 2014. “Commandlinefu.com.” <a href="http://www.commandlinefu.com" class="uri">http://www.commandlinefu.com</a>.
60+
* Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.
61+
* Goyvaerts, Jan, and Steven Levithan. 2012. <em>Regular Expressions Cookbook</em>. 2nd Ed. O’Reilly Media.
62+
* Cooper, Mendel. 2014. “Advanced Bash-Scripting Guide.” <a href="http://www.tldp.org/LDP/abs/html" class="uri">http://www.tldp.org/LDP/abs/html</a>.
63+
* Robbins, Arnold, and Nelson H. F. Beebe. 2005. <em>Classic Shell Scripting</em>. O’Reilly Media.
6064

6165

6266
### Python, R, and SQL
6367

64-
68+
* Wickham, Hadley. 2009. <em>ggplot2: Elegant Graphics for Data Analysis</em>. Springer.
69+
* McKinney, Wes. 2012. <em>Python for Data Analysis</em>. O’Reilly Media.
70+
* Rossant, Cyrille. 2013. <em>Learning Ipython for Interactive Computing and Data Visualization</em>. Packt Publishing.
6571

6672

6773
### Interpreting Data
6874

75+
* Shron, Max. 2014. <em>Thinking with Data</em>. O’Reilly Media.
76+
* Patil, DJ. 2012. <em>Data Jujitsu</em>. O’Reilly Media.
6977

7078

7179
## Getting in Touch

book/_output.yml

+1
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@ bookdown::gitbook:
22
css: style.css
33
split_bib: no
44
keep_md: yes
5+
citation_package: natbib
56
includes:
67
in_header: [google_analytics.html]
78
config:

book/index.Rmd

+2
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,8 @@ site: bookdown::bookdown_site
66
documentclass: book
77
bibliography: [library.bib, tools.bib]
88
biblio-style: apalike
9+
nocite: |
10+
@Schutt2013, @Peek2002, @Heddings2006, @Molinaro2005, @HTTP, @docopt, @Rossant2013, @manpages, @Raymond2003, @Goyvaerts2012, @Dougherty1997, @Tange2011a, @Cortez2009, @commandlinefu, @Cooper2014, @Russell2013, @Warden2011
911
link-citations: yes
1012
description: "This is the website for Data Science at the Command Line, published by O'Reilly October 2014 First Edition. This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data."
1113
---

book/library.bib

+2-1
Original file line numberDiff line numberDiff line change
@@ -164,7 +164,7 @@ @book{Tukey1977
164164
165165
@webpage{AWS2014,
166166
title = {AWS Command Line Interface Documentation},
167-
author = {{Amazon Web Services}},
167+
author = {Amazon&nbsp;Web&nbsp;Services},
168168
url = {http://aws.amazon.com/documentation/cli/},
169169
year = {2014}
170170
}
@@ -241,6 +241,7 @@ @book{Goyvaerts2012
241241

242242
@book{Dougherty1997,
243243
author = {Dale Dougherty and Arnold Robbins},
244+
title = {sed \& awk},
244245
edition = {2nd Ed},
245246
publisher = {O'Reilly Media},
246247
year = {1997},

book/style.css

+5-3
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
.rmdcaution, .rmdimportant, .rmdnote, .rmdtip, .rmdwarning {
1+
.rmdcaution, .rmdimportant, .rmdnote, .rmdtip, .rmdwarning, .rmdcomment {
22
padding: 1em 1em 1em 5em;
33
margin-bottom: 2em;
44
background: #f5f5f5 1em center/3em no-repeat;
@@ -8,7 +8,7 @@
88
background-image: url("images/exclamation-triangle.svg");
99
}
1010
.rmdimportant {
11-
background-image: url("images/exclamation-triangle.svg");
11+
background-image: url("images/comment.svg");
1212
}
1313
.rmdnote {
1414
background-image: url("images/info-circle.svg");
@@ -19,7 +19,9 @@
1919
.rmdwarning {
2020
background-image: url("images/comment.svg");
2121
}
22-
22+
.rmdcomment {
23+
background-image: url("images/comment.svg");
24+
}
2325

2426
p.caption {
2527
color: #777;

docs/conclusion-1.html

+20-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@
2525
<meta name="author" content="Jeroen Janssens">
2626

2727

28-
<meta name="date" content="2017-12-27">
2928

3029
<meta name="viewport" content="width=device-width, initial-scale=1">
3130
<meta name="apple-mobile-web-app-capable" content="yes">
@@ -364,15 +363,35 @@ <h2><span class="header-section-number">10.3</span> Where To Go From Here?</h2>
364363
<p>As this book is on the intersection of the command line and data science, many related topics have only been touched upon. Now, it’s up to you to further explore these topics. Below is a list of topics and suggested resources to consult.</p>
365364
<div id="apis" class="section level3">
366365
<h3><span class="header-section-number">10.3.1</span> APIs</h3>
366+
<ul>
367+
<li>Russell, Matthew. 2013. <em>Mining the Social Web</em>. 2nd Ed. O’Reilly Media.</li>
368+
<li>Warden, Pete. 2011. <em>Data Source Handbook</em>. O’Reilly Media.</li>
369+
</ul>
367370
</div>
368371
<div id="shell-programming" class="section level3">
369372
<h3><span class="header-section-number">10.3.2</span> Shell Programming</h3>
373+
<ul>
374+
<li>Winterbottom, David. 2014. “Commandlinefu.com.” <a href="http://www.commandlinefu.com" class="uri">http://www.commandlinefu.com</a>.</li>
375+
<li>Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.</li>
376+
<li>Goyvaerts, Jan, and Steven Levithan. 2012. <em>Regular Expressions Cookbook</em>. 2nd Ed. O’Reilly Media.</li>
377+
<li>Cooper, Mendel. 2014. “Advanced Bash-Scripting Guide.” <a href="http://www.tldp.org/LDP/abs/html" class="uri">http://www.tldp.org/LDP/abs/html</a>.</li>
378+
<li>Robbins, Arnold, and Nelson H. F. Beebe. 2005. <em>Classic Shell Scripting</em>. O’Reilly Media.</li>
379+
</ul>
370380
</div>
371381
<div id="python-r-and-sql" class="section level3">
372382
<h3><span class="header-section-number">10.3.3</span> Python, R, and SQL</h3>
383+
<ul>
384+
<li>Wickham, Hadley. 2009. <em>ggplot2: Elegant Graphics for Data Analysis</em>. Springer.</li>
385+
<li>McKinney, Wes. 2012. <em>Python for Data Analysis</em>. O’Reilly Media.</li>
386+
<li>Rossant, Cyrille. 2013. <em>Learning Ipython for Interactive Computing and Data Visualization</em>. Packt Publishing.</li>
387+
</ul>
373388
</div>
374389
<div id="interpreting-data-1" class="section level3">
375390
<h3><span class="header-section-number">10.3.4</span> Interpreting Data</h3>
391+
<ul>
392+
<li>Shron, Max. 2014. <em>Thinking with Data</em>. O’Reilly Media.</li>
393+
<li>Patil, DJ. 2012. <em>Data Jujitsu</em>. O’Reilly Media.</li>
394+
</ul>
376395
</div>
377396
</div>
378397
<div id="getting-in-touch" class="section level2">

docs/creating-reusable-command-line-tools.html

+10-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@
2525
<meta name="author" content="Jeroen Janssens">
2626

2727

28-
<meta name="date" content="2017-12-27">
2928

3029
<meta name="viewport" content="width=device-width, initial-scale=1">
3130
<meta name="apple-mobile-web-app-capable" content="yes">
@@ -629,6 +628,16 @@ <h3><span class="header-section-number">4.3.2</span> Processing Streaming Data f
629628
</div>
630629
<div id="further-reading-3" class="section level2">
631630
<h2><span class="header-section-number">4.4</span> Further Reading</h2>
631+
<ul>
632+
<li>Docopt. 2014. “Command-Line Interface Description Language.” <a href="http://docopt.org" class="uri">http://docopt.org</a>.</li>
633+
<li>Robbins, Arnold, and Nelson H. F. Beebe. 2005. <em>Classic Shell Scripting</em>. O’Reilly Media.</li>
634+
<li>Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.</li>
635+
<li>Perkins, Jacob. 2010. <em>Python Text Processing with Nltk 2.0 Cookbook</em>. Packt Publishing.</li>
636+
<li>McKinney, Wes. 2012. <em>Python for Data Analysis</em>. O’Reilly Media.</li>
637+
<li>Rossant, Cyrille. 2013. <em>Learning Ipython for Interactive Computing and Data Visualization</em>. Packt Publishing.</li>
638+
<li>Wirzenius, Lars. 2013. “Writing Manual Pages.” <a href="http://liw.fi/manpages/" class="uri">http://liw.fi/manpages/</a>.</li>
639+
<li>Raymond, Eric Steven. 2014. “Basics of the Unix Philosophy.” <a href="http://www.faqs.org/docs/artu/ch01s06.html" class="uri">http://www.faqs.org/docs/artu/ch01s06.html</a>.</li>
640+
</ul>
632641

633642
</div>
634643
</div>

docs/exploring-data-1.html

+5-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@
2525
<meta name="author" content="Jeroen Janssens">
2626

2727

28-
<meta name="date" content="2017-12-27">
2928

3029
<meta name="viewport" content="width=device-width, initial-scale=1">
3130
<meta name="apple-mobile-web-app-capable" content="yes">
@@ -1019,6 +1018,11 @@ <h3><span class="header-section-number">7.4.9</span> Summary</h3>
10191018
</div>
10201019
<div id="further-reading-6" class="section level2">
10211020
<h2><span class="header-section-number">7.5</span> Further Reading</h2>
1021+
<ul>
1022+
<li>Wickham, Hadley. 2009. <em>ggplot2: Elegant Graphics for Data Analysis</em>. Springer.</li>
1023+
<li>Janert, Philipp K. 2009. <em>Gnuplot in Action</em>. Manning Publications.</li>
1024+
<li>Tukey, John W. 1977. <em>Exploratory Data Analysis</em>. Pearson.</li>
1025+
</ul>
10221026

10231027
</div>
10241028
</div>

docs/getting-started.html

+4-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@
2525
<meta name="author" content="Jeroen Janssens">
2626

2727

28-
<meta name="date" content="2017-12-27">
2928

3029
<meta name="viewport" content="width=device-width, initial-scale=1">
3130
<meta name="apple-mobile-web-app-capable" content="yes">
@@ -613,6 +612,10 @@ <h3><span class="header-section-number">2.3.7</span> Help!</h3>
613612
</div>
614613
<div id="further-reading-1" class="section level2">
615614
<h2><span class="header-section-number">2.4</span> Further Reading</h2>
615+
<ul>
616+
<li>Heddings, Lowell. 2006. “Keyboard Shortcuts for Bash.” <a href="http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc" class="uri">http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc</a>.</li>
617+
<li>Peek, Jerry, Shelley Powers, Tim O’Reilly, and Mike Loukides. 2002. <em>Unix Power Tools</em>. 3rd Ed. O’Reilly Media.</li>
618+
</ul>
616619

617620
</div>
618621
</div>

docs/index.html

+1-2
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@
2525
<meta name="author" content="Jeroen Janssens">
2626

2727

28-
<meta name="date" content="2017-12-27">
2928

3029
<meta name="viewport" content="width=device-width, initial-scale=1">
3130
<meta name="apple-mobile-web-app-capable" content="yes">
@@ -328,7 +327,7 @@ <h1>
328327
<div id="header">
329328
<h1 class="title">Data Science at the Command Line</h1>
330329
<h4 class="author"><em>Jeroen Janssens</em></h4>
331-
<h4 class="date"><em>December 27, 2017</em></h4>
330+
<h4 class="date"><em>January 3, 2018</em></h4>
332331
</div>
333332
<div id="welcome" class="section level1 unnumbered">
334333
<h1>Welcome</h1>

0 commit comments

Comments
 (0)