Jupyter notebook – InforMedia Services (IMS)

Registration is now open for the workshop:

>>>>>>>>> https://ntmoore.github.io/2018-06-02-stcloud/ <<<<<<<<<<<<<

Syllabus:

The Unix Shell

Files and directories
History and tab completion
Pipes and redirection
Looping over files
Creating and running shell scripts
Finding things
Reference…

Programming in Python

Using libraries
Working with arrays
Reading and plotting data
Creating and using functions
Loops and conditionals
Defensive programming
Using Python from the command line
Reference…

https://en.wikipedia.org/wiki/GNU_nano

https://swcarpentry.github.io/shell-novice/03-create/

http://pad.software-carpentry.org/2018-06-02-stcloud

Jupyter is IDE https://en.wikipedia.org/wiki/Integrated_development_environment

https://searchcloudcomputing.techtarget.com/definition/Infrastructure-as-a-Service-IaaS

JSON file format where Jupiter data is stored. HMTL and Markdown (simplified HTML).

Panda: https://pandas.pydata.org/

React OS (JS) https://en.wikipedia.org/wiki/ReactOS

+++++++++++++++++++
more on Software Carpentry workshops on this iMS blog
https://blog.stcloudstate.edu/ims/2017/10/26/software-carpentry-workshop/

Reproducibility Librarian? Yes, That Should Be Your Next Job

BY EGLANTINE RONFARD

https://www.jove.com/blog/2017/10/27/reproducibility-librarian-yes-that-should-be-your-next-job/

Vicky Steeves (@VickySteeves) is the first Research Data Management and Reproducibility Librarian

Reproducibility is made so much more challenging because of computers, and the dominance of closed-source operating systems and analysis software researchers use. Ben Marwick wrote a great piece called ‘How computers broke science – and what we can do to fix it’ which details a bit of the problem. Basically, computational environments affect the outcome of analyses (Gronenschild et. al (2012) showed the same data and analyses gave different results between two versions of macOS), and are exceptionally hard to reproduce, especially when the license terms don’t allow it. Additionally, programs encode data incorrectly and studies make erroneous conclusions, e.g. Microsoft Excel encodes genes as dates, which affects 1/5 of published data in leading genome journals.

technology to capture computational environments, workflow, provenance, data, and code are hugely impactful for reproducibility. It’s been the focus of my work, in supporting an open source tool called ReproZip, which packages all computational dependencies, data, and applications in a single distributable package that other can reproduce across different systems. There are other tools that fix parts of this problem: Kepler and VisTrails for workflow/provenance, Packrat for saving specific R packages at the time a script is run so updates to dependencies won’t break, Pex for generating executable Python environments, and o2r for executable papers (including data, text, and code in one).

a plugin for Jupyter notebooks), and added a user interface to make it friendlier to folks not comfortable on the command line.

I would also recommend going to conferences:

FORCE11 is a great conference for open access/source, data management, and reproducibility.
Research Data Access and Preservation (RDAP) is another great conference with more of a librarian focus.
Library and Information Technology Association (LITA) Forum is another librarian-focused meeting that usually has data management and reproducibility sessions.

++++++++++++++++++++++++
more on big data in an academic library in this IMS blog
academic library collection data visualization

https://blog.stcloudstate.edu/ims/2017/10/26/software-carpentry-workshop/

https://blog.stcloudstate.edu/ims?s=data+library

more on library positions in this IMS blog:
https://blog.stcloudstate.edu/ims?s=big+data+library
https://blog.stcloudstate.edu/ims/2016/06/14/technology-requirements-samples/

on university library future:
https://blog.stcloudstate.edu/ims/2014/12/10/unviersity-library-future/

InforMedia Services (IMS)

Tag: Jupyter notebook

Software Carpentry Workshop at SCSU Python