{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Extracting Ancient Placenames using Named Entity Recognition and `spaCy`\n", "\n", "In this notebook, we are going to explore an important subfield of natural language processing, named entity recognition or NER. This notebook was inspired by William Mattingly's textbook on the subject, which you can find here: https://ner.pythonhumanities.com/intro.html. I encourage you to browse this book and play around with the code that he provides.\n", "\n", "Objectives:\n", "* Understand what NER is and why it matters\n", "* Collect a training data for a machine learning NER approach\n", "* Train a NER model using `spaCy`'s utilities\n", "* Do some basic inference to see where our model succeeds and where it fails" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "What is NER and why does it matter?\n", "\n", "Named entity recognition describes any method which uses computational methods to extract from unstructured text names of people, places or things. It is a hard classification task, meaning that every word in a document is either a type of named entity or it is not. For example in the following sentences:\n", "> My name is Peter Nadel. I work at Tufts University.\n", "\n", "the token 'Peter Nadel' could be tagged as a PERSON tag, where as Tufts Univerisity could be tagged with a PLACE tag. Importantly, in NER, no token can receive more than one tag.\n", "\n", "As a result, NER can be using in a wide variety of fields and applications." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# The example we'll be looking at today comes from these maps that I made\n", "from IPython.display import IFrame\n", "\n", "IFrame(\n", " \"https://tuftsgis.maps.arcgis.com/apps/webappviewer/index.html?id=576ff8f0e3954ad781916e94dfb34f7e\",\n", " width=1400,\n", " height=800,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "To make a map like the one we see above, I needed to extract place names from an ancient text. Thankfully, ToposText (https://topostext.org/) has marked all of the place names already. So why do we need NER?\n", "\n", "ToposText has a lot of texts, but it doesn't have every ancient text. I work quite frequently with faculty members from Classic Studies, examining texts much more obscure than those in ToposText, which may not be digitized at all, let alone have place names marked. So, what we need is an ancient place name NER model, which we can pass any amount of texts and get back all of the places in that text.\n", "\n", "*A quick note on `spaCy`'s native English NER*: `spaCy` has NER built into their English and other models. This NER is of good quality for modern texts, but is quite poor when applied to texts that aren't that similar to the training data (in the English model's case: modern news articles scraped from the web). For specialty tasks like the one we will do today, training your own model will always return better results." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Scraping\n", "\n", "Here, I will show how we got our training data and what form it has to be in for training NER models with `spaCy`." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from bs4 import BeautifulSoup\n", "from tqdm import tqdm\n", "import re\n", "import requests" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "res_html = requests.get(\n", " \"https://topostext.org/work/148\"\n", ").text # The Natural Histories, Plinius Senior" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "topos_test = BeautifulSoup(res_html, features=\"html\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "p_tags = topos_test.find_all(\"p\")[2:]\n", "p_tags[3].a # first a tag in the third p tag" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "`spaCy` expects the training data to be in `jsonl` format, meaning that there must be a list where each element is a list that contains the text and a dictionary of the entities that are in the text, with their label (in this case, their HTML class)." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import collections\n", "import string\n", "\n", "TRAIN_DATA = []\n", "\n", "for i, p in enumerate(p_tags):\n", " document = []\n", " p_text = re.sub(\"(§\\s\\d\\.\\d+)|(§\\s\\d+)\", \"\", p.text)\n", " document.append(p_text.replace(\"\\xa0\\xa0\", \"\"))\n", "\n", " ent_dict = collections.defaultdict(list)\n", " for a in p.find_all(\"a\"):\n", " span_list = []\n", " if a.text.istitle():\n", " if \"href\" in a.attrs:\n", " span_list.append(\n", " a.text.translate(str.maketrans(\"\", \"\", string.punctuation))\n", " )\n", " span_list.append(a[\"href\"].split(\"/\")[1])\n", " else:\n", " span_list.append(\n", " a.text.translate(str.maketrans(\"\", \"\", string.punctuation))\n", " )\n", " span_list.append(a[\"class\"][0])\n", " if len(span_list) > 0:\n", " ent_dict[\"entities\"].append(span_list)\n", " document.append(dict(ent_dict))\n", " TRAIN_DATA.append(document)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "TRAIN_DATA[0]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Importantly though, you can't just use the text of the entities. It must be the indices of the names within the text. So we can do that below." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## TASK 1\n", "We can't feed our model the raw text of a name. Instead, we need to give it the indices of the name in the original text.\n", "\n", "Let's write some code to do that." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test = TRAIN_DATA[0]\n", "test" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "type(test)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[0], type(test[0])" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[0].find(\"Muses\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1], type(test[1])" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"], type(test[1][\"entities\"])" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"][0], type(test[1][\"entities\"][0])" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"][0][0], type(test[1][\"entities\"][0][0])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "My Solution" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[0].find(\"Muses\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"][0][0]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "start = test[0].find(test[1][\"entities\"][0][0])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Answer\n", "`TRAIN_DATA` -> `List[String, Dict[List[String, String]]]`" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[0].find(\"Muses\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "test[1][\"entities\"]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# using find method\n", "test[0].find(test[1][\"entities\"][0][0])" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# add length of the original string\n", "start = test[0].find(test[1][\"entities\"][0][0])\n", "end = start + len(test[1][\"entities\"][0][0])\n", "start, end" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# check\n", "test[0][396:401] # looks good!" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# turn this into a function we can use for each string\n", "def names2indices(train_data_instance):\n", " for i, name in enumerate(train_data_instance[1][\"entities\"]):\n", " # get indices\n", " start = train_data_instance[0].find(name[0])\n", " if start != -1:\n", " end = start + len(name[0])\n", " else:\n", " end = -1\n", " pass\n", "\n", " # update dict\n", " train_data_instance[1][\"entities\"][i].pop(0) # remove string name\n", " train_data_instance[1][\"entities\"][i].insert(0, end) # add end\n", " train_data_instance[1][\"entities\"][i].insert(0, start) # add start" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# do it to all of them in our training data\n", "for instance in TRAIN_DATA:\n", " if \"entities\" in instance[1].keys():\n", " names2indices(instance)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "TRAIN_DATA[0]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Completing our training data\n", "\n", "Now that we can do this to one ToposText URL we can do it to all the others." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# can make this into a single function\n", "def create_TRAIN_DATA(url):\n", " \"\"\"\n", " Takes in a ToposText URL and creates spaCy training data.\n", " \"\"\"\n", "\n", " # scraping\n", " res = requests.get(url)\n", " if res:\n", " res_html = res.text\n", "\n", " soup = BeautifulSoup(res_html, features=\"html\")\n", " TRAIN_DATA = []\n", "\n", " p_tags = soup.find_all(\"p\")[1:]\n", " for i, p in enumerate(p_tags):\n", " document = []\n", " p_text = re.sub(\"(§\\s\\d\\.\\d+)|(§\\s\\d+)\", \"\", p.text)\n", " document.append(p_text.replace(\"\\xa0\\xa0\", \"\"))\n", "\n", " ent_dict = collections.defaultdict(list)\n", " for a in p.find_all(\"a\"):\n", " span_list = []\n", " if a.text.istitle():\n", " if \"href\" in a.attrs:\n", " span_list.append(a.text)\n", " span_list.append(a[\"href\"].split(\"/\")[1])\n", " else:\n", " try:\n", " span_list.append(a.text)\n", " span_list.append(a[\"class\"][0])\n", " except:\n", " continue\n", " if len(span_list) > 0:\n", " ent_dict[\"entities\"].append(span_list)\n", " document.append(dict(ent_dict))\n", " TRAIN_DATA.append(document)\n", "\n", " # reformatting\n", " for instance in TRAIN_DATA:\n", " if \"entities\" in instance[1].keys():\n", " names2indices(instance)\n", "\n", " return TRAIN_DATA" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "create_TRAIN_DATA(\"https://topostext.org/work/148\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "TRAIN_DATA = []\n", "# Max of 901\n", "for i in tqdm(range(2, 901)):\n", " train = create_TRAIN_DATA(f\"https://topostext.org/work/{i}\")\n", " for item in train:\n", " TRAIN_DATA.append(item)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "len(TRAIN_DATA)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "TRAIN_DATA[:3]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# pickling let's us save this work so we don't have to do it again\n", "import pickle\n", "\n", "file_name = \"topostext_training_data.pkl\"\n", "\n", "# pickle and save the data\n", "with open(file_name, \"wb\") as file:\n", " pickle.dump(TRAIN_DATA, file)\n", "\n", "print(f\"Training data saved to {file_name}\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Training\n", "Now that we have our data in the correct form, we can start training.\n", "\n", "First, we have to convert the `TRAIN_DATA` object into a special `spaCy` type called a `DocBin` (code taken from Mattingly)." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!wget https://tufts.box.com/shared/static/2p4ez2vas90v2zn8mf3lgdv6lltzl75d.pkl -O topostext_training_data.pkl" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import pickle\n", "\n", "with open(\"/content/topostext_training_data.pkl\", \"rb\") as file:\n", " loaded_training_data = pickle.load(file)\n", "\n", "print(\"Loaded training data length:\", len(loaded_training_data))" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "loaded_training_data[0]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The above section in the original Greek.\n", "\n", "![Screenshot 2023-10-16 at 10.00.16 AM.png]()" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!mkdir data" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import spacy\n", "from spacy.tokens import DocBin\n", "from pathlib import Path\n", "import warnings\n", "\n", "\n", "def convert(lang: str, TRAIN_DATA, output_path: Path):\n", " nlp = spacy.blank(lang)\n", " nlp.max_length = 2000000\n", " db = DocBin()\n", " for text, annot in TRAIN_DATA:\n", " doc = nlp.make_doc(text)\n", " ents = []\n", " if \"entities\" in annot.keys():\n", " for start, end, label in annot[\"entities\"]:\n", " span = doc.char_span(start, end, label=label)\n", " if span is None:\n", " msg = f\"Skipping entity [{start}, {end}, {label}] in the following text because the character span '{doc.text[start:end]}' does not align with token boundaries:\\n\\n{repr(text)}\\n\"\n", " warnings.warn(msg)\n", " else:\n", " ents.append(span)\n", " try:\n", " doc.ents = ents\n", " except:\n", " continue\n", " db.add(doc)\n", " db.to_disk(output_path)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from sklearn.model_selection import train_test_split\n", "\n", "# two splits: first split (train and validation for testing our model after it trains)\n", "train_valid, valid_post_training = train_test_split(\n", " loaded_training_data, test_size=0.2, random_state=1\n", ")\n", "\n", "train, valid = train_test_split(train_valid, test_size=0.2, random_state=1)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "len(train), len(valid), len(valid_post_training)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import warnings\n", "\n", "warnings.filterwarnings(\"ignore\")\n", "\n", "convert(\"en\", train, \"/content/data/train.spacy\")\n", "convert(\"en\", valid, \"/content/data/valid.spacy\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import os\n", "\n", "os.listdir(\"/content/data\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## TASK 2\n", "\n", "We still need one final piece before we can start training. This is the config file that will tell the training program how to initialize our set up. We can download a blank version here: https://spacy.io/usage/training.\n", "\n", "Finally we just need to add the file locations of our training and validation data. In Colab, double-clicking on the base_config.cfg file will open up an editor on the side. In the train field, put `data/train.spacy` and in the dev field, put `data/valid.spacy`." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!python -m spacy init fill-config /content/base_config.cfg /content/config.cfg" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# really useful feature in spacy\n", "!python -m spacy debug data /content/config.cfg" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!python -m spacy train config.cfg --output models" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# zip up model for easier download\n", "!zip -r topostext_ner_model_full.zip models" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from google.colab import files\n", "\n", "files.download(\"topostext_ner_model_full.zip\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Inference\n", "\n", "Now that we have a model, we can see how well we did. For this inference step, I want to use example texts that I know were not in the training data." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!git clone https://github.com/pnadelofficial/topos_text_NER.git" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!unzip topos_text_NER/topostext_ner_model_full.zip" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "inf_text = \"\"\"\n", "An upper vest, once Helen's rich attire,\n", "From Argos by the fam'd adultress brought,\n", "With golden flow'rs and winding foliage wrought,\n", "Her mother Leda's present, when she came\n", "To ruin Troy and set the world on flame;\n", "The scepter Priam's eldest daughter bore,\n", "Her orient necklace, and the crown she wore\n", "Of double texture, glorious to behold,\n", "One order set with gems, and one with gold.\n", "\"\"\"\n", "\n", "# Vergil Aenid 1.643\n", "# perseus.tufts.edu/hopper/text?doc=Perseus%3Atext%3A1999.02.0052%3Abook%3D1%3Acard%3D643" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import spacy\n", "\n", "trained_nlp = spacy.load(\"models/model-best\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "doc = trained_nlp(inf_text)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "type(doc)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "people = []\n", "for ent in doc.ents:\n", " if ent.label_ == \"people\":\n", " people.append(ent)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "people" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "!python -m spacy download en_core_web_sm" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import spacy\n", "\n", "nlp = spacy.load(\"en_core_web_sm\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# spacy pretrained model\n", "test_doc = nlp(inf_text)\n", "[(e, e.label_) for e in test_doc.ents]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# ours\n", "[(e, e.label_) for e in doc.ents]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Evaluation\n", "\n", "In this section, we'll use the `valid_post_training` list of examples to calculate an F1 score for our model. This score is a good summary of how well our model does on this NER task, as it not only counts what entities the model correctly identified, but also considers if their labels are correct as well.\n", "\n", "F1 score is the harmonic mean of *precision*, the ratio of correctly identified entities (true positives, TP) and all identified entities (the sum of true positives and false positives, FP), and *recall*, the ratio of correctly identified entities (TP) and all true entities (true positives and false negatives, FN).\n", "\n", "$Precision = \\frac{TP}{TP+FP}$\n", "\n", "$Recall = \\frac{TP}{TP+FN}$\n", "\n", "$F1 = \\frac{2*Precision*Recall}{Precision+Recall} = \\frac{2*TP}{2*TP+FP+FN}$" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "len(valid_post_training)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "valid_post_training[0]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import spacy\n", "\n", "trained_nlp = spacy.load(\"models/model-best\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "doc = trained_nlp(valid_post_training[0][0])\n", "predicted_ents = [(d.text, d.label_) for d in doc.ents]\n", "predicted_ents" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "true_ents_raw = valid_post_training[0][1][\"entities\"]\n", "true_ents = []\n", "for t_e in true_ents_raw:\n", " text = valid_post_training[0][0][t_e[0] : t_e[1]]\n", " true_ents.append((text, t_e[-1]))\n", "\n", "true_ents" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "predicted_set = set(predicted_ents)\n", "true_set = set(true_ents)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "true_positives = len(predicted_set.intersection(true_set))\n", "false_positives = len(predicted_set - true_set)\n", "false_negatives = len(true_set - predicted_set)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "precision = true_positives / (true_positives + false_positives)\n", "recall = true_positives / (true_positives + false_negatives)\n", "f1_score = 2 * (precision * recall) / (precision + recall)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "precision, recall, f1_score" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "def eval_one(i):\n", " doc = trained_nlp(valid_post_training[i][0])\n", " predicted_ents = [(d.text, d.label_) for d in doc.ents]\n", "\n", " true_ents_raw = valid_post_training[i][1][\"entities\"]\n", " true_ents = []\n", " for t_e in true_ents_raw:\n", " text = valid_post_training[i][0][t_e[0] : t_e[1]]\n", " true_ents.append((text, t_e[-1]))\n", "\n", " predicted_set = set(predicted_ents)\n", " true_set = set(true_ents)\n", "\n", " true_positives = predicted_set.intersection(true_set)\n", " false_positives = predicted_set - true_set\n", " false_negatives = true_set - predicted_set\n", "\n", " return len(true_positives), len(false_positives), len(false_negatives)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from tqdm import tqdm # for the progress bar\n", "\n", "all_true_positives = 0\n", "all_false_positives = 0\n", "all_false_negatives = 0\n", "\n", "for i in tqdm(range(len(valid_post_training))):\n", " if \"entities\" in valid_post_training[i][1]:\n", " tp, fp, fn = eval_one(i)\n", " all_true_positives += tp\n", " all_false_positives += fp\n", " all_false_negatives += fn\n", "\n", "precision = all_true_positives / (all_true_positives + all_false_positives)\n", "recall = all_true_positives / (all_true_positives + all_false_negatives)\n", "f1_score = 2 * (precision * recall) / (precision + recall)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "round(f1_score, 3) # not too shabby!" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Yay! We did it!" ] } ], "metadata": { "colab": { "collapsed_sections": [ "EX_0ickOW0CF" ], "provenance": [] }, "kernelspec": { "display_name": "Python 3", "name": "python3" }, "language_info": { "name": "python" }, "mystnb": { "execution_mode": "off" } }, "nbformat": 4, "nbformat_minor": 0 }