Jupyter Book Open Source Development

Jupyter Book Open Source Development

March 20, 2020
9:00 am

About

Jupyter Book lets you build an online book using a collection of Jupyter Notebooks and Markdown files. Its output is similar to the excellent Bookdown tool, and adds extra functionality for people running a Jupyter stack.

Transcript

[Music] hello the internet welcome to open source directions hosted by open teams

the webinar that brings you all the news about the future of your favorite open-source projects my name is Henry

battery and I’m the growth marketer at open teams and I’ll be your host open source directions co-hosting with me

today eight years hi I’m MIDI in and I’m excited to call this episode of open source directions

I’m a postdoc at the University of Illinois at urbana-champaign I work on a project called YT and I’m based in

Urbana Illinois and I’m Chris I work at University of California at Berkeley and

I primarily focus on the Jupiter project and a related project called binder thank you very much so before we go into

the meat of this episode we have our famous tweet of the week section where each of our panelists presents a tweet

that they’ve been enjoying recently so Chris you’re up first yeah so the tweet that I found probably

unsurprising is related to the recent kind of global concerns around that corona virus and I noticed that a fellow

Syfy PI data contributor guy ala bucco and several of his collaborators as well

recently shared a link to a little web app that they built using I think plotly and scikit-learn and a couple of

different pieces of the Syfy ecosystem the visualize is the latest data about

what’s going on in the world with the the kovat outbreak and you can include some projections and some modeling

efforts and things like that so that I was kind of a cool way of seeing the Syfy community coming together and giving their treatment to this awesome

and if you want to find this other link to the tweets will be in the chat and making your op Nick’s okay so mine is

related to animals because we all need a little bit of joy in our lives right now

so maybe some of you have seen the Shedd Aquarium has let their penguins walk

around because the aquarium is closed so they’re letting them like walk around in the general area around the aquarium and

so this particular tweet is a video of a bunch of penguins hopping down stairs like they’re going from one floor to

another and they don’t walk downstairs there’s like this very adorable plopping noise as they come down so please enjoy

awesome so yeah my tweet of the week which will be posted in the chat so essentially open-source ventilator

Ireland there’s this movement towards creating a ventilator design and then

trying to figure out how we can produce that at a very very low cost given the corona virus and the pandemic that’s

actually happening at the moment I so I thought this much weight of the week because I I love the idea that everyone comes together and the open source

really the open source ethos is probably one way that I think we can all collectively come together remotely and

work towards really solving this problem and we all know I just watched a video that in New York they’re calling for

anyone who’s basically got a ventilator but donated and helped people that because it’s going to be it’s in high

demand and it’s very little supply considering everything that’s happening so so much share that all right well now

that we’re done with the tweet of the week let’s dive a little bit deeper in view for the book so derivative work is

an open source project that lets you build an online book using the collection of Jupiter notebooks and

markdown files it’s a simple good book down tool and it adds an extra

functionality for people running at Jupiter stack so you can find jupiter book on github at github comm slash

jupiter slash jupiter – book it has around 669 stars on github and about

1932 downloads a month on pipe I so so Chris can you tell us now that we have

this nice introduction to Jupiter book what was this president why was this project started and what need does it

feel yeah sure so originally Jupiter book was started at UC Berkeley and we

needed it in order to provide open notebook based on learning content for some of the larger classes that we have

at the University so in the last couple of years Berkeley has revamped a lot of its pedagogy around data science and

data analytics and particularly around introductory data sign and data analytics we wanted to use

largely an open source stack that is built on top of Jupiter infrastructure

and the kind of broader sigh pi PI data community and we had a lot of content that was written in notebooks but wanted

a way to present it to students in a way that felt a little bit more familiar a little bit more like a traditional book

that they could sort of quickly glanced through and browse so for this reason we created Jupiter book as a way to very

quickly convert a collection of our notebooks into static based HTML site that we could just send a link to

students and then they could quickly look through the content and then you know grab a notebook if they want it to

really dive deeper into what was inside yes so it sounds like you’ve sort of built it and Venus bye-bye other

technologies which was very prevalent in his space so could you explain a little bit of the history of the name and logo

of the project yeah so I mean I think that essentially we were trying to think

of what would be a good representation of a collection of Jupiter notebooks

presented in a more traditional form and so it seemed like a book was a good way

of thinking about that so if you think about each notebook as an individual page then the collection of notebooks is

a collection of pages and it’s perfect

to make puns about so yeah very funny

yeah so a lot of really great projects out there that aim to help you kind of

stitch together collections of content files and then show them as like a holistic unit probably the most

well-known one is a product that comes from the our studio community called book down and this is a really excellent

tool in the our community in many ways it’s kind of in my opinion one of the gold standards of creating like

computational books and it definitely inspired a lot of the creation design ideas around Jupiter book which tries to

serve a similar kind of role before the Jupiter community and the king you know building around the Jupiter stack

they’re also a couple of really interesting services that people run for hosting collections of notebooks probably the earliest one and the

Jupiter world is called and be fewer which is a service that the Jupiter community’s been running for

probably like six or seven years and it just likes you really quickly render a notebook it’s hosted on github and then

browse it as like a static HTML page and then or a recent one that I just recently learned about came out of the

FAFSA a I project and Jeremy Howard they really they recently launched a really

interesting project I think it’s called fast pages that uses github pages and

you have actions to quickly allow you to generate a static blog using Jupiter

notebooks on github and I’m looking forward to checking that one out more – okay that’s awesome yeah thank you for

sharing that and one thing that I’ve noticed last week with Pi janitor they have were inspired by the our community

in some of your day field and it seems like you are also inspired by something the our community appeal to so it’s

interesting to see that pattern but on the topic of that so what technology is

it built on your project right so there are two different answers to this

question and I can get into more details about the details there later on but the

the short version is that the current version of Jupiter book is largely a combination of two things one of them is

a Python API and a little lightweight Python library that uses a project

called env convert which can convert Jupiter notebooks into a collection of other kinds of output formats in this

case HTML and a project called Jekyll which is a ruby based static website

generator but allows you to stitch a collection of HTML pages into a static website and the current version of

Jupiter book basically just wraps those two pieces and then gives people a command line interface that they can use

to quickly generate books from their content however it’s I say it’s a

complicated answer because we recently received a grant to work on the Jupiter book project and a few related tools

from the salon foundation and we’re in the middle of rewriting most of the backend of how the Jupiter book project

uses the underlying infrastructure and in particular we’re gonna try to utilize Sphinx which is another Python project

and documentation and stack for Python and we think that that’s going to both allow

us to have more high quality content and include things like citations and equations and cross-references to tuck

it into later as a lot of standardized our toolkit along some of the other standard tools in the in the Python you

could system can you tell us who started

it you kind of hinted at this a little bit but I’m very curious yeah so I think that well it’s at the Henry’s points the

inspiration for a Jupiter book has come from lots of different kinds of sources so in a sense you know it was started by

a collection of open source projects most specifically at UC Berkeley there’s

always been kind of a Jupiter team at UC Berkeley that’s been part of the Jupiter kidney to be more broadly and we helped

the university develop tools deploy those tools maintain those tools for some of the pedagogical and research

news cases at the University and so we had initially developed Jupiter book to serve some of the larger data science

courses that we have that’s great to hear yeah so exporter minded listeners

to that right now we’re talking about Jupiter book and in the roadmap discussion we’re going to go more into

detail about where you can keep contribute and how you can contribute but if you are hearing things that are

interesting you and if you do have any questions then please post the questions in the right-hand side of the webinar

screen and we’ll get to your questions in the Q&A section at the end of the episode so please ask anything that

comes to mind anything is interested interesting you or anything related to Jupiter book just make sure you list

some questions so you talked about yes so that we’ve we know who started it that’s also that’s great to hear that

they came out of the university and and you’ve really been driving this could you tell us who maintains the project

yeah so I think that the most hands-on maintainer over the last say year and a

half or so which is about the amount of time that project has been around has been be as a part of the Jupiter team at

Berkeley we also got a lot of really good help early on from Sam Lau who was a student at UC Berkeley a year ago and

Elizabeth you pray who’s up in Montreal and it’s also contributed quite a bit it

has now like a broader kind of a relatively small but more diverse community of contributors that just help

out here and there and in the future as I kind of alluded to earlier we recently

got a grant then it’s going to partially join together a couple of different python-based book projects that

hopefully also grow the contributor base of which bitter book project and the process cool I’m so can you I’m curious

like you’ve talked a little bit about the origins and your book kind of starting in academia and some of the

collaborators you talked about also sound like their academics can you tell us what communities the users and the

contributors are from yeah so I think that originally it was you know people who are teaching classes and that’s

certainly still a really common user archetype that I see easy Jupiter book

but really I mean to provoke is a it’s a communication tool and it turns out that

there are lots of different use cases where people look wants to just share collections of notebooks in a way that

looks like an online kind of interactive book and so a lot of people that do you

know the run data science projects and want to just share the results with people and a little bit more of a familiar form people that want to bundle

notebooks as a part of like a scientific paper or research analysis have used

Jupiter book as well and know a few different publishing pipelines or looking into using it as a part of their

publishing tool chains so I think that that that’s becoming a more diverse group and just really quickly one other

thing I forgot to mention so on the Jupiter book repository on github github has these dependency chains that it

builds so it can tell you like what other projects depend on a given project and actually if you click on that you’ll

you’ll basically get a list of all of the different Jupiter books that people are hosting on github I think it’s

something like six or seven hundred different books and it’s a good way to just get some inspiration for what are the kinds of things the people are using

the tool that’s also yes sounds like you really do it out of community who’s really taking this and grabs it and

taking ownership and sounds like the contributor base to is a lot recently so sort of leads into my

next question has the project’s been participating in any diversity and inclusion efforts yeah so I think that

Jupiter book as a sub project within Jupiter more broadly has certainly been

making a lot of inroads into this Jupiter itself is like a really large open and attempts to be inclusive

community and it’s done a lot of different work to try and improve these these pieces of its community over the

past couple of years it’s always a work in progress and there’s definitely still a lot more that the Jupiter community

can and will be doing but we try to you know for example have all of our meetings open to the general public and

you know meeting minutes posted online so those can make the timezones they’re there a lot of the teams that I work

with we alternate our team the schedule of our meetings to be more

aysia friendly or America friendly or Europe friendly we also have a few kind

of specific efforts in diversity and inclusion so we recently applied for one of the czi essential open source science

grants the eos s grants which we luckily were awarded one that was with jupiter

hub and the binder projects which are kind of you know related to although not the exact same thing as jupiter book and

one of the main goals of that is to be able to pay for time for people to work on the project and we’re gonna

prioritize that time for people that probably wouldn’t be able to contribute otherwise so right now that’s gonna pay

for a woman in Romania in georgeanna to do some work on the littlest Jupiter hub

and the binder project as well that’s great to hear yeah it’s great to see that it seems like you’re really participating and um you know and you

have it having goals and at least a roadmap to be able to expand that and make the contributor base more go-to

first which is great to hear so now we’re going to shift into the project demo where we’ll get to see some of the

cool features of Jupiter book and how it works so while Chris is getting set up we would like to take this opportunity

to thank our sponsor icon site for sponsoring this episode of open source directions Quan site creating value from

data so you ready Chris take it away okay can you all see my screen yeah yeah okay I’m

gonna work from a couple of slides and then I’m also going to hopefully pop out and do a few demos and we’ll see how

that goes okay I don’t really need to do my own introductions anymore but really

quickly I want to point out some of the faces down at the bottom so in the bottom left is Elizabeth Duprey and sam

lau and as i mentioned both of them did a lot of work on the earlier iterations of jupiter book and on the current

version of jupiter book and on the bottom right that’s John Sturges ski and Greg pepper also PIAA nu and Australia

and a PI at Northern Arizona University in the States respectively and they are sort of key contributors on the next

version of Jupiter but I’ll talk about a little bit and so really quickly just to

motivate the reason that this existed in the first place was the general

inspiration for how Berkeley has been approaching its courses at the University which is that they want their data science education to be for

everybody at the University and by that I mean students who aren’t going to be you know computer science majors and

sophistic majors but also for students who are going to be you know philosophy majors and English literature majors and

Poli Sci majors obviously they don’t need all of them to take like a full four years worth of statistically

rigorous courses but they want all of them to be familiar with the concepts of data analytics and statistical thinking

and whatnot the challenge is that at Berkeley you know it’s a large public

university and if you say oh we’re gonna offer like a data science course for literally anyone who wants to come without any background and you know

coding or or statistics you get classrooms like this so this is the largest well it’s not even really a

classroom this is the Performing Arts Hall at UC Berkeley and it’s filled to capacity with students for their

introductory gain an eighth course which is a course that utilizes the Jupiter

stack very very heavily and so a lot of what we have to do is try and build technology that can facilitate the

learning for this very very large and very very diverse and heterogeneous student population some of whom are

familiar with coding and Python and whatnot but many of whom are not and so I’m really the goal of the course

is to teach concepts and give them some experience with interactive computing

using an entirely open stack and also to partner with other open projects and

open communities in doing so but what we didn’t have the ability to do was to

display a kind of notebook contents and a more familiar or accessible form particularly for the students that

weren’t already comfortable with like spinning up their own Jupiter notebook server and whatnot and the other thing

that we wanted to do is try to strike a balance between like full interactivity

with a live Python kernel that you would get and like a jupiter notebook or jupiter lab versus like a completely

static website that you would get if you were just you know rendering some some HTML and like a traditional blogging

platform and so really jupiter book is trying to strike that balance between the two of them and give the author the

ability to choose how much interactivity they want to bring into the reading experience okay so that brings me to

jupiter book and just to repeat the tagline for the tenth time the goal of

jupiter book which will allow you to online book with jupiter books and with markdown that down there that link

that’ll send you to the current version of jupiter book it looks like this imma

be the easiest way to describe jupiter book is just to show you what builds a Jupiter book looks like and this is

actually a page I can probably just go to on my own so if you go to inferential thinking com you’ll actually see the

generated book for that dating aid class so all the material is hosted online it just exists as a collection of Jupiter

notebooks on github and really off the bat the first thing that you’ll see is that it doesn’t really look that

different from what a lot of other websites do you know it’s got like a left table of contents it’s got a right within page table of contents but if you

look at the contents of each page itself you can see that there was a notebook under the generated all of this stuff so

this page itself is actually in a Jupiter notebook on github and it was rendered into the static HTML form with

all of the outputs populated and whatnot using Jupiter book and so there are some things in here

that you can start to play her out with and give the authors a little bit more

control over like what kind of interactive interactivity they want to insert I don’t know that the inferential

thinking textbook makes use of this but I’ll show you an example of this later on but for example if you want to

include pngs from your book if you want to use MATLAB or Seaborn you can do that

you can also include interactive outputs things that would normally output like HTML and JavaScript in the Jupiter

notebook so things like mocha or plotly or Altair those will also render just

fine in Superbook as well so you can use it to end that interactive content in your books okay so that being said I

want to quickly give a note of warning that there may be can some confusion

that I’m introducing here because as I mentioned before there’s a new version of Jupiter book that’s coming out quite

soon and I decided that I think it would be neat more interesting and potentially were youthful to talk about the new

version of Jupiter book as opposed to focusing on the newer version of Jupiter book that said the new version of

Jupiter book is still like very much so in an early beta kind of form so some things might change some things might be

a little bit different than if you just go to Jupiter book morgue but by and large the experience is pretty much the

same I think that the user facing workflow for Jupiter book shouldn’t change very much when it moves to new

build system would be hopefully a new faster more feature full back-end that’s

that’s working under the hood and I’ll go into a little bit of detail about that later so how do you create a

Jupiter book well in order to have in order to build a Jupiter book you basically need two things

o you need three things you need a collection of notebooks and markdown files that’s your book content you need

a table of contents file that just defines the structure of your book and then there’s an optional configuration

file that just allows you to you know turn on different kinds of features and things like that and so just as a little

helper Gepetto book comes bundled with a create function it basically just creates a

little lightweight template book that you can use – you know build on top of or you could

just use to see how about the structured so then you can adapt your own contents it’s a Jupiter book so if you type in at

the command line Jupiter book and create my book name I should say this is after you know pip installing Jupiter book once you run

Jupiter but create and then give your book a name it’ll generate a collection of files and a new folder for you and

those files are basically the three things that I just mentioned before so first off in your config yamo file it’s

just a collection of key-value pairs that define things like the title of your book who’s the author do you want

to show the sidebar by default or not you want to include binder links along with your book content there’s also a

table of contents file and this is the file that actually defines the structure of your book so the very first entry

here is going to be the first page the people land on you can define sub pages for any given page and then that allows

you to kind of nest sections within each cell and then files are just the path to

a given either I play a new file or a markdown file relative to your books

folder and then finally just have a collection of content files as I mentioned before we want that to either

be markdown files or Jupiter notebook files so I have a question for you Chris

relate up to this can you execute Jupiter book created sort of creates this cookie cutter template for you

that’s makes like a starting at the start of your book right when and then

it creates a table of contents for you how much is in that like is it just these exact files or does it create like

extra chapters and sort of subsections for you I will show you in a second oh

[Laughter]

okay so once you have like this set of content in a folder it’s really all you need to actually build a book so when

you run Jupiter book build and you point it to a folder that has that content inside of it what it does is it collects

all of those content files and then it renders them as a static HTML site that

you can then you know either deploy it online nullify or gh-pages whether you can just browse locally to see how things look

and importantly you want to make sure that you keep like all of the information that’s there in a notebook like the inputs the outputs eventually

you want to also include extra features for publishing like citations and cross-references and whatnot so after

you build your book you went from what was originally this collection of configuration and content files and now

we’ve created a new folder called build right now there will be one output in

that folder for HTML but in the future there’s going to be other kinds of outputs for things like you know PDF or epub or docx that kind of thing but

everything that’s inside of HTML is just a fully functioning HTML website ok so

after having shown you this but a bunch of static information I’m gonna give this a shot to actually show you how it

looks so I’ve just got a little Jupiter lab instance running here and I have an

empty folder and if I Jupiter Jupiter well JB is sort of a shorthand for

Jupiter but called do the whole thing create my cool book then it generates a

folder my cool book and if I look inside that folder I can see that I basically have exactly the files that I just

showed before there’s one extra file that I didn’t mention which is a big check file which I can show some some

functionality I’m not going to cover the exact structure of these because I already told you about it before but

this is the configuration file this is the table of contents file it’s just a light light a lightweight little one

that points to these files that are here and then the content that’s inside of these things is just like your regular

old content that you would expect to see in a notebook so here’s a regular

Jupiter notebook and whatnot so now if I go to if I change into my book folder

inside Jupiter book build and I was pointed to the current folder that I’m in then under the hood it’s actually

using Sphinx as I had mentioned before it collects all of those files it resolves some stuff about the content

that’s inside and then it generates this build folder right here so if I look inside of the build folder this is

a functioning static website and in fact I can now go to Chrome build HTML and

open up my index file and that will show the Jupiter book that I just built again

it’s very bare-bones and simple because it’s just there to sort of show you how things are meant to look just to browse

really quickly if I go to this content with notebooks file if you remember this is B this is the notebook that was used

under the hood for this file and this is how it renders on the actual page itself it looks good do you notice they don’t

have any outputs here because I haven’t run them in this page if I were to run everything here now I’ve got the outputs

there if I rerun Superbook build and

then refresh this page now I’ve got outputs okay so it’s dependent on

whether you execute the notebook went before you execute build what the outputs are gonna look like in the HTML

the rendered HTML pages it is currently that’s only because we haven’t built in

the execution feature yet like automatic

execution of notebooks that need it basically okay so another thing I noticed is on the example book that you

showed us on a little bit earlier there was a link to interact with binder and

also a download link and this when you just showed only has the download link

for the notebook on the HTML that you just showed us so does that this binder

only hook up once it is a rendered page on gh-pages or something like that or is

it elsewhere that one though so that’s actually a feature that already exists I just haven’t turned it on yet then it’s

a related project that we’re working on it’s kind of another one of the under the hood pieces but a lot of Matic those

binder links for you so that that’s there as well but what minigame is referred to is for any of the pages that

have a Jupiter notebook under the hood if you know the github repository where

those notebooks are stored then you can automatically inject a binder link notebooks and then you’ll be able to use

that to immediately send users to like a fully interactive version that’s running on a Jupiter server and the cloud would

binder yeah it’s nice of those integrates so easily together as well

yeah I mean I think that a lot of the a lot of the reasons for that is that you know one of the philosophies the design

philosophies of the Jupiter project is is to try to design fairly modularly and adopt like common standards and

protocols across the different pieces of the project so that you know on your hub which is a service that runs something

like my binder not Ward also has the ability for other things to hook into it programmatically so basically I think

you see I’ll just show you that in the current Jupiter but Cupid org has the

current version of it so here you see these interact buttons up here if I am on a page that’s got a notebook under

the hood I’ve configured this Jupiter book to say hey the the collection of

notebooks is this github repository and the notebooks is at this subfolder in there and so then Jupiter book knows how

to build up these interact links programmatically and just send me over to a binder instance if I were to click

on that so I’m not going to wait for the further the binders and spin up but you can kind of see how a reader can really

quickly like jump into this interactive version of it if they want to okay so

that’s like a very quick link here’s how you go from zero to a working Jupiter book in 50 seconds is should I pause for

questions there anything like that or do you want me to keep on going forward you can keep going we have questions a little bit later in the in okay thing

okay cool so in that case for the next piece of

this I want to talk about where the Jupiter book project is moving in the

future a lot of what I just described some parts of it were this like new build system that I did that I had

mentioned but a lot of it was just thick

I think we I think Christmas from

Verizon mm-hmm I think just have a chat so he comes back yeah hopefully

everything is okay now might be you know what I might be yeah I don’t know

exactly what happened but we’ll see what happens Oh Christy back we lost you for a second hi first you forgot it was like right

when you started talking about the executable book friend except your number that would be great so the exact

a book project is a sloan Foundation funded project it’s a collaboration between four different

python-based publishing projects so IAB is an integrative biology project at

Northern Arizona University headed by Greg capresso on the Left quant econ is

an open textbook for quantitative economics which is run out of Australian National University and that’s John Sturges key in the middle who’s the PI

on that project and then Krystal is recently graduated from Imperial College

London and he’s been running a project on publish which tries to bring in more traditional publishing features into the

Jupiter ecosystem and then obviously there’s the Jupiter book project coming out of UC Berkeley and so the goal of

this grant is to basically align the interests and the lines of stacks of all these different projects and try to have

like a single unified Python stack for publishing as opposed to each of us having our own separate pieces and so

this is the project that’s kind of driving a lot of the new developments it’s going to make its way into the next

version of Jupiter book you can find just like an NEHTA website for the project that EBP jupiter book or thanks

for executive old book project so really quickly i want to talk about some of the

interesting features that are coming up in the pretty near future probably the next like month or so in what I’m just

calling Jupiter book 2.0 so really quickly just to start off I wanted to

show like the build chain that I’m talking about here as I showed you before it was it was sort of going

straight from like static I plan and markdown files into a static website and that process is largely going to be

the same we’re hopefully a few extra pieces in the middle so first off we still want people to be able to write

their content either in Jupiter notebook files or in markdown files but then we

want to have a step where they can execute those files and potentially cash their outputs so like a separate cache

that Jupiter book knows how to use and then we want when we actually build the

site to be able to draw from that cache if needed so that we don’t need to reiax acute the notebooks every single time

that we build the site and that way we know that the outputs that are there are like the latest outputs that that should

be there but we don’t have to wait for execution every single time to be a little part and that’s actually important because projects like quant

econ I think they have many many many dozens of pages and it takes something like two and a half hours to run all of

the material for their whole book and so for some of these larger more complex projects we need to have some ability to

cache the outputs and check whether or not those outputs are stale before we build the book and then secondly once we

have all of these built books we then want to grab some more like rich publishing style information from them

so we want to look for things like cross references between pages equation references citations we want to pull out

all of that rich metadata it’s like mix up a proper multi-page document and then

be able to use all of that when we generate the final HTML website and then

finally once we have that in rich document model we want to have multiple kinds of documents so if you want a

static web played like what you think currently does you can do that if you want to have a PDF that looks really

nice and professional and maybe flows through la tech you can how about I think that people are working on some

some docx and some epub outputs as well so that’s kind of at a high level what

the new build process is going to look like and as I mentioned before I think

one of the goals of the reworking of the back end is to use a little bit more of

a modular architecture in the Jupiter and the Python ecosystem so currently the 1.0 version of jupiter book is

basically a sing repository on github it does a ton of different kinds of things right it does

like new book page mill conversion it stitches them all together into a website and it adds all of the bells and

whistles around like interactivity and binder links and things like that what we’d like to do now is build on top of

sphynx which is a really excellent documentation generator and Python ecosystem and basically what we’ve

discovered is that if you want to build like a high quality publication style output then a lot of the features that

you need for that are pretty similar to some of the features that you need for just building high quality technical documentation for your projects and so

our hope is that by using Sphinx we’ll be able to align our stack more closely with the broader Python ecosystem

so rather than largely relying on a tool in the Ruby stack which many of us don’t

know as well and we have less of an ability to contribute you know upstream improvements to we can use Fink’s we can

modularize a lot of the pieces of legibility does into Sphinx extensions that the broader Python community can

use on their own now we’ll get into some details about that in a second and hopefully make this project feed into

that kind of broader open Python ecosystem as well so a couple of things

that that brings to mind them one of them is so I mentioned Sphinx here and

for those of you who are familiar with Sphinx you might have thought sinks doesn’t use markdown speaks uses restructured text where’s Jupiter

notebooks use markdown and that is true so that was one of our concerns with using Sphinx is that many authors are

just not familiar what’s writing in restructured text and so what we decided to do was build a very lightweight

flavor of markdown that is a first-class citizen in the sphinx ecosystem we call

this something called a mist or markedly structured text and basically the goal

of mist is just take jupiter notebook markdown the same markdown that we all write in jupiter and then add a couple

of syntactic pieces to let you use two of the most powerful tools and the sphinx ecosystem with our rules and

directives so really quickly for those of you who aren’t familiar rules the

directives they’re kind of like functions but in markdown so roll or a directive can have a name it

can have arguments it can have keyword value option configuration and then it can also have content and this sends the

general structure of each of those things so directives are more pie align blocks here you have a directive name

you can have arguments you can configure it and then you can add your content in their roles are in line with the text

itself so you have a role name and then you just give it some kind of interpreted text and these are patterns

they’re quite common in distinct ecosystem that allow you to control different kinds of behavior in your

final built HTML website so an example of each of these things is on the Left

this is how you could define like a figure in Jupiter book you would say okay I’m using the figure directive I

want to point it to a PNG file and I want to give it a caption and a label and name and then later on in a role

there’s a ref role which allows you to make a cross-reference to another item so say this is on a different page I can

insert my label here and because I’ve defined it over here in this directive then Sphinx will know how to

automatically insert a URL link that cross-references this figure on whatever page is over here and then it’ll link

you back to the page where it’s linked probably yeah exactly right nice that’s really cool oh yeah that was a good file

that you were doing earlier on yeah and I can I can show that up in a little bit if I if we’ve got some time for that for

the last section so let me let me try showing you what this looks like that

so I’m gonna go back to my to my page here this is my little mini build site

and I’ll edit this markdown file so a couple of examples of rules and directives let’s say that I’m writing

some content and then I want to include an extra little piece of information that’s like a sidebar sort of like a side notes that I don’t want to break up

my the flow of the main text of the page but I want to give like a little contextual piece of information to the

side for that I can use the sidebar directive so if I do sidebar like that and then I give it a title my sidebar

title here’s my sidebar content and I close

off my directive then when I rebuilt the book and go to my markdown page assuming

that that my internet works then I have a little sidebar content down here and

so this is a way you could sort of control the look in the feel and the structure of your page just by using

these little kind of lightweight markup pieces I can show you when you see I

think I can show you a citation as well so for if I before that there’s this big tech file inside the demo book which

right now because I’m apparently a vain person I’ve populated with all of the papers for my PhD thesis so this is one

way to get my agent so now if I wanted to cite one of those pages I’m typing

some content and now a citation I can use the cite rule and I just pass it the

label that’s inside that FinTech file pretty cool easy

and now when I rebuild my book and

refresh this then I’m typing some content a citation this links out to the

page with the bibliography is inserted right here at the bottom extremely cool that’s so nice

nice Vaizey yeah so so that’s like two examples of how you can use this little

lightweight extra markup syntax to control new kinds of functionality in

your page and the other kind of cool thing about that is that again these are kind of like functions which means that

people can extend them in lots of different kinds of ways and actually the Sphinx ecosystem has a huge collection

of extensions and tools and pieces that are already out there for extending the functionality

of Sphinx itself and thus by extension jupiter book for example that that

citation syntax that I was using that uses a Sphinx extension called speak spin tech to allow textile references in

your documentation extremely nice I just

want to say is something goes to deal with citations that like makes it so much easier features to talk about if

there’s full-time we’ve got time with leak proof a few features yes yeah okay

cool okay so then a couple of extra things one of them is as a part of this process

again modularizing the stack that’s there we also needed the ability to have

like a first-class citizen for rendering notebooks in science Fink’s and that’s a project that we’re calling this and so

this is a project that builds on top of the sphinx sorry the mist parser that i mentioned before and this allows you to

basically include jupiter notebooks directly into your Sphinx bill so there’s straight from I plan D into your

instincts documentation and has all the nice pieces for you know including outputs and things like that and again

that’s a tool that but the the mist parser as well as miss Denby those are

both completely modular tools that don’t inherently know anything about Jupiter bugs so the tool that anybody can use in

the Sphinx ecosystem so either write their documentation purely in markdown or to write documentation with Jupiter

notebooks and included in their own open source projects another piece of this

project is the upstream contributions to a tool called Jupitus joopa text is a really

interesting two way conversion tool between jupiter notebooks and various text-based formats so there is now a

jupiter a Jupitus extension for nist markdown and what that means is that you

can go from an iamb file here and have lossless two-way synchronization between

this file and a markdown representation of that file so basically like code

cells become tick-tick-tick code cell and you get a kernel the one thing that it doesn’t transfer

over are the outputs because that would be pretty heinous to try to put into a markdown file but I’ll get to that in

just a second and then another kind of interesting tool that that we’ve just

been playing around with things once you have the ability to represent all of this information in Spanx like across

multiple pages of your book then you can start to do some pretty interesting things so like one pattern that we’ve seen is that a lot of people they run

their analyses in one notebook and they’ll have this kind of like negative complex a bunch of different you know a

bunch of Python or R code inside tons of different plots tons of statistics that they’re calculating and then in another

notebook or in another markdown file they’re actually writing the document itself and so we tried to sort of enable

that a little bit more with a little sub module that we called glue and what glue allows you to do is embed snippets of

information in a notebook and attach them to a key so that later on in your

book you can reference that key and insert its value into whatever other page you’re working from and so just a

really quick example of what I’m talking about there so in this figure here I generated this plot you know a bunch of

lines and attached to it is this map plot little figure object if I do from

mist and be import glue and then I say glue my cool figure and then pass it

Figg else that display is false so if display is true then it’ll actually try to display the figure when you glue it

just is like a sanity check what I’ve just done is I’ve taken this figure and

I’ve attached it to this key name inside the notebook and now well then let’s

let’s do another thing here so you have some good up here right so I’ve got data let’s just calculate the mean of my data

and I’ll say mean equals me and then

I’ll glue my mean to the variable I just created called

me so now I’ve glued two different keys

into my notebook my cool figure and my meet if I go back into this markdown

file here then I can do things like that

so use the glue directive and pass it

the key that I added in this notebook so my cool figure let’s just see what

happens there and let’s add some extra stuff to my cooler here and here’s a cool caption

Chris we just go to probably one more minute on this second sir yes and then

here is my statistic and then my mean

and now if I rebuilt the book okay if I

rebuild the book then glue normally but I’m gonna skip this piece because I

don’t think that we have time to go through it so anyway that kind of like gives you again this is all the very

alpha content and as I mentioned before this is sort of like where we’re trying to get to is increasing the ability to

sort of enrich the the relationships between the different pages that are

there in the various content pages and so the last thing I’ll just mention is

if you’re interested in trying any of this stuff out then what I recommend is checking out betas like the newer kind

of build interface for Jupiter book some of those modular components instincts that I mentioned the mist parser and

mist NBER both of those links and then if you just want to follow the project at large at the bottom of the page those

are executive old book projects as well awesome thanks for my flat crease that

was really enjoyable and I wish we had that when I was at university because I was just staring at boring lecture

slides and they were going through them but it’s great to see so that now that we’ve discussed about where the

project is heading so as a roadmap section we’re going to move on to the Q&A section soma Deccan do you want to

keep the first question off yeah okay so our first one is from Phillip abhi

follow-up statement to both your statement regarding the future interaction between Jupiter book and

Spinks notebooks are mostly a markdown based well Sphinx is mostly restructured

test I know that recently syncs allows markdown – isn’t this adding an additional layer of complexity and then

they have a clarification is the EBP going to be Jupiter book 2.0 are they

going to be two separate projects thank you yes so two the first one it definitely

is adding a layer of complexity in the sense that there is like I would say it’s more like cognitive complexity I’m

the users side because it’s another thing for them to think about the goal

of markdown we’ve all of adding markdowns of Sphinx was really trying to stick with the most common version of

markdown that’s out there which is called common mark which is closer to a community standard than anything else

that exists in markdown ecosystem and then just adding like a few extra pieces that are needed to get it working

instincts one of the challenges that we ran into was we wanted to utilize the

Sphinx ecosystem but we found that for a lot of people writing and restructured text was just like a non-starter

personally I read in restructure text a lot and I enjoy it but especially for a

lot of our introductory users again a lot of people that are using this stack or like you know professors who are not

super comfortable comfortable with like the deep guts of the Python ecosystem but they’re very familiar reading

markdown because they use github and things like that so really our goal was a few lightweight pieces on the Suites

ecosystem to allow people to write a documentation purely in markdown and

really like Sphinx itself is meant to be extensible in this way so I think we’ve

lost Chris again we can pause for a minute and hopefully he’ll come back um but you know while we’re waiting if

any of you have any more questions please add them to the thing we’ve just

lost Chris he’ll be back I’m sure he’ll be back this is gonna be good but thank you for your precision so far

everybody this has been really great it’s been an old episode it’s very

enjoyable no curse copy and chris is back yeah Chris you’re back so so I

think I think that where I was caught up was like in many ways Fink’s is meant to be expensable extensible in this way so

it’s just swings builds in a lot of different hooks so that you can add new kinds of functionality restructuredtext

is the markup language that they went with by default but they very intentionally wanted it to be possible for people to write and other kinds of

markup languages and so that’s what the mist parser is a parser for sphinx that’s an extension of a sphinx

ecosystem for markdown so to answer your second question basically the executive

Oh book project is like a meta project under which it’s basically a collaboration that is explicitly attached to this grant that project will

contribute to and build a number of tools in the open-source Python ecosystem miss 10b miss parser Sphinx

but also just up streaming a lot of a lot of pieces – like Cupid text the Jupiter Yuka system and whatnot Jupiter

book will I think be a more kind of controller process over these different pieces and I think that eventually the

executive book project will dissolve because the grant will run out and at some point in the next you know six

months or so the grant goes for another three years but at some point of the next six months or so we’re gonna pivot

more of our effort towards like building out the open-source communities around these projects up streaming as many

contributions as we can to other projects and the Jupiter and the Python Sphinx ecosystem and you know trying to

build a more proper like sustainable open-source community around oh so super nice yeah question in the

question box from Sophia yang can I use Jupitus with Jupiter book so that my

content files would be dot PI files instead of ipython notebook bios yes yeah it’s not in there yet in the new

version but eventually yeah but we want this for anything that is like a valid joopa text input should also work with

Jupiter book I think that the missed markdown flavor of Jupitus will be

treated in some kind of special way probably for the purposes of caching the outputs and storing them for later on so

that you know when you basically when you rebuild a book it’ll first check and say okay which pages have you edited and in those pages

did you edit any of her code if not then I don’t need to rerun all the code I could just use the cached outputs but if

so I need to re execute it I think some of that logic might be like Jupiter notebook or NIST notebook specific but

in general yeah we want to support anything that you put textual support awesome great thank you for that and

thank you very much for your questions everyone listening right so now it’s time for our famous rant and rave

section where we both get 15 seconds to either rat or rave about what ever topic

we want so Chris it’s your soapbox yeah so I think that for me the thing that I

wanted to rave about is I guess an offense also related to Kairos but for

example I think one of the things that I really enjoy about this is that you get

a different kind of perspective into people’s lives than you normally do when you’re just meeting them in conference

rooms and whatnot and one of the things that I particularly enjoy is watching the number of dogs cats and children

that decide to come in and like pawing at their their parents faces in the middle of zoom chats and Skype

meetings and whatnot I don’t know why but I think that I find it like a nice kind of way to personalize what is a

difficult situation for a lot of people and working from home not seeing a lot of other people and it brings me a

little bit of happiness to get those little interruptions whether they’re there once it or not on the other end I

enjoy them um okay well I’d like to rave

about libraries and librarians I think right now we all are thinking about ways to get the content we want while we’re

at home and there’s a lot of really great resources going around that libraries offer that are not in person

so you don’t just have to get books in person there’s lots of really good apps where you can download books but you can

also download things like comic books or audio books and libraries are like collecting a lot of resources related to

that so I’d like to thank all of our librarians they’re amazing and thank you for being great thank you very much

chris is sort of in line with you but I have a bit of a rant and it’s more a hypothesis now that everyone’s working

from home I just look at snapchat I look at social media and everyone seems to be more healthy and really enjoying working

from home so I just wanted to rant by the fact that I think there’s gonna be a bit of an uprising in a lot of companies

where people are going to demand now that they work from home so that’s my little rant but yeah thank you very much

that’s all we have time for today thanks for watching and thanks for listening you can find us on twitter at at

and at con site AI so if you’re interested in also funding open-source

projects including jupiter book you can find all the project roadmaps at Quonset comm slash projects chris so where can

people find you in jupiter book yeah so me personally and just see whole graph on twitter as well as on github I spend

a lot of my time there as well as a veggie burger a community forum that discourse that Jupiter or not orgy the

Jupiter book project I shared a couple of links before and could include those and you know whatever materials we

follow up with on this webcast but Jupiter book that work for the current version of Jupiter book beta Jupiter

book work for the the new version of it and then github.com / executive o book

project is just the place where we are like managing the process of the project

itself and you can kind of keep up to date with some of the movement there if you’re interested in following along or

contributing as well great thank you Chris thank you for your time and thanks everyone for listening join us again

episode 4 a pointed discussion on Apache arrow alright thank you very much

[Music]

Committed to your success with open source. OpenTeams is your easy point of access to a range of services from our open source expert network, from commercial open source support to open source training, staffing & recruiting services, and more.

Resources

OpenTeams