[notebooks] Add affiliation
authorVincent Michel <vincent.michel@logilab.fr>
Tue, 01 Jul 2014 15:10:34 +0200
changeset 458 9527d4b3d381
parent 456 d93286fdd149
child 461 d5dc8d2c4311
[notebooks] Add affiliation
notebooks/Named Entities Matching with Nazca.ipynb
notebooks/Record linkage with Nazca - Example Dbpedia - INSEE.ipynb
notebooks/Record linkage with Nazca - part 1 - Introduction.ipynb
notebooks/Record linkage with Nazca - part 2 - Normalization and blockings.ipynb
notebooks/Record linkage with Nazca - part 3 - Putting it all together.ipynb
--- a/notebooks/Named Entities Matching with Nazca.ipynb	Tue Jul 01 14:43:49 2014 +0200
+++ b/notebooks/Named Entities Matching with Nazca.ipynb	Tue Jul 01 15:10:34 2014 +0200
@@ -14,8 +14,25 @@
       "<h1>Named Entities Matching with Nazca</h1>\n",
       "\n",
       "\n",
+      "This IPython notebook show some features of the Python Nazca library :\n",
+      "<ul>\n",
+      "    <li> website : <a href=\"http://www.logilab.org/project/nazca\">http://www.logilab.org/project/nazca</a></li>\n",
+      "    <li> source : <a href=\"http://hg.logilab.org/review/nazca\">http://hg.logilab.org/review/nazca</a></li>\n",
+      "</ul>\n",
+      "<ul>\n",
+      "    <li> original notebook : <a href=\"http://hg.logilab.org/review/nazca/raw-file/cdc7992b78be/notebooks/Named%20Entities%20Matching%20with%20Nazca.ipynb\">here !</a></li>\n",
+      "    <li> date: 2014-07-01</li>\n",
+      "    <li> author: Vincent Michel  (<it>vincent.michel@logilab.fr</it>, \n",
+      "                                  <it>vm.michel@gmail.com</it>) @HowIMetYourData</li>\n",
+      "<ul>"
+     ]
+    },
+    {
+     "cell_type": "markdown",
+     "metadata": {},
+     "source": [
       "Named Entities Matching is the process of recognizing elements in a text and matching it\n",
-      "to different types (e.g. Person, Organization, Place). This may be related to Record Linkage (for linking entities from a text corpus and from a reference corpus) and to Named Entities Recognition."
+      "    to different types (e.g. Person, Organization, Place). This may be related to Record Linkage (for linking entities from a text corpus and from a reference corpus) and to Named Entities Recognition."
      ]
     },
     {
@@ -35,7 +52,7 @@
        "output_type": "pyout",
        "prompt_number": 1,
        "text": [
-        "<IPython.core.display.HTML at 0x7f706c036950>"
+        "<IPython.core.display.HTML at 0x1ceb950>"
        ]
       }
      ],
@@ -1351,12 +1368,12 @@
      "outputs": [
       {
        "html": [
-        "The discovery of the teenagers\u2019 bodies in the <a href=\"http://dbpedia.org/resource/West_Bank\">West Bank</a> prompted vows of retaliation by <a href=\"http://dbpedia.org/resource/Israel\">Israel</a>, which blamed the <a href=\"http://dbpedia.org/resource/Palestinian\">Palestinian</a> group <a href=\"http://dbpedia.org/resource/Hamas\">Hamas</a> for the killings.<br/><br/><a href=\"http://dbpedia.org/resource/The_South\">The South</a> <a href=\"http://dbpedia.org/resource/African\">African</a> track star\u2019s agent and friend testified that the couple\u2019s relationship was strong and that he did not intend to kill her.<br/><br/>The <a href=\"http://dbpedia.org/resource/Japanese_prime_minister\">Japanese prime minister</a> announced that his government would reinterpret the antiwar <a href=\"http://dbpedia.org/resource/Constitution\">Constitution</a> to allow the armed forces to come to the aid of friendly nations.<br/><br/>The first clues that led to the grisly discovery of the bodies came only hours after their abduction in the <a href=\"http://dbpedia.org/resource/West_Bank\">West Bank</a> was reported.<br/><br/>The lawmakers were under pressure to name an inclusive government as insurgents mount a violent challenge north and west of <a href=\"http://dbpedia.org/resource/Baghdad\">Baghdad</a>.<br/><br/>The only viable political future for the country is federation. But <a href=\"http://dbpedia.org/resource/America\">America</a>\u2019s first priority is to see <a href=\"http://dbpedia.org/resource/ISIS\">ISIS</a> crushed.<br/><br/><a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Petro\">Petro</a> <a href=\"http://dbpedia.org/resource/O\">O</a>. <a href=\"http://dbpedia.org/resource/Poroshenko\">Poroshenko</a> said he would resume full-scale efforts to quash the pro-Russian uprising in eastern <a href=\"http://dbpedia.org/resource/Ukraine\">Ukraine</a>.<br/><br/><a href=\"http://dbpedia.org/resource/Nicolas_Sarkozy\">Nicolas Sarkozy</a>, the former <a href=\"http://dbpedia.org/resource/French_president\">French president</a>, has been under scrutiny for possible financial irregularities in his <a href=\"http://dbpedia.org/resource/2007\">2007</a> campaign and for other alleged offenses.<br/><br/>A huge throng of people, mostly young, took to <a href=\"http://dbpedia.org/resource/Hong_Kong\">Hong Kong</a>\u2019s streets <a href=\"http://dbpedia.org/resource/Tuesday\">Tuesday</a>, defying <a href=\"http://dbpedia.org/resource/Beijing\">Beijing</a>\u2019s dwindling tolerance for challenges to its control.<br/><br/><a href=\"http://dbpedia.org/resource/Myanmar\">Myanmar</a> is enjoying some new diplomatic clout, leading <a href=\"http://dbpedia.org/resource/China\">China</a> to court the country as <a href=\"http://dbpedia.org/resource/Beijing\">Beijing</a> presses its territorial claims in the <a href=\"http://dbpedia.org/resource/South_China_Sea\">South China Sea</a>.<br/><br/>The last remaining <a href=\"http://dbpedia.org/resource/African\">African</a> teams in the <a href=\"http://dbpedia.org/resource/World_Cup\">World Cup</a>, <a href=\"http://dbpedia.org/resource/Algeria\">Algeria</a> and <a href=\"http://dbpedia.org/resource/Nigeria\">Nigeria</a>, were eliminated on <a href=\"http://dbpedia.org/resource/Monday\">Monday</a>, ensuring that the continent would once again remember the <a href=\"http://dbpedia.org/resource/2014\">2014</a> event for off-the-field squabbles.<br/><br/>As <a href=\"http://dbpedia.org/resource/Hong_Kong\">Hong Kong</a> prepared for its annual pro-democracy march <a href=\"http://dbpedia.org/resource/Tuesday\">Tuesday</a>, a survey of residents found more discontent than ever with the <a href=\"http://dbpedia.org/resource/Chinese_government\">Chinese government</a>\u2019s policies toward the city, especially among the young.<br/><br/><a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Petro\">Petro</a> <a href=\"http://dbpedia.org/resource/O\">O</a>. <a href=\"http://dbpedia.org/resource/Poroshenko\">Poroshenko</a> ended a 10-day cease-fire, saying that rebels had not put down their weapons and had persisted in attacking government troops.<br/><br/>At least <a href=\"http://dbpedia.org/resource/22\">22</a> people were killed in the firefight \u2014 all of them assailants, the military said. One soldier was injured.<br/><br/>The giant <a href=\"http://dbpedia.org/resource/French\">French</a> bank admitted to transferring billions of dollars on behalf of <a href=\"http://dbpedia.org/resource/Sudan\">Sudan</a> and other countries the <a href=\"http://dbpedia.org/resource/United_States\">United States</a> has blacklisted.<br/><br/>The former chief justice of the <a href=\"http://dbpedia.org/resource/Constitutional_Court\">Constitutional Court</a> was sentenced to life in prison for corruption, the heaviest sentence ever for graft in one of the most corrupt countries in the world.<br/><br/>A former aide to former <a href=\"http://dbpedia.org/resource/Prime_Minister\">Prime Minister</a> <a href=\"http://dbpedia.org/resource/Petr_Necas\">Petr Necas</a> who later married him was found guilty of abuse of power on <a href=\"http://dbpedia.org/resource/Monday\">Monday</a> in a scandal that exposed their affair and toppled the government a year ago.<br/><br/>The court found that in <a href=\"http://dbpedia.org/resource/1973\">1973</a> an <a href=\"http://dbpedia.org/resource/American\">American</a> naval officer provided <a href=\"http://dbpedia.org/resource/Chilean\">Chilean</a> officials with information on two <a href=\"http://dbpedia.org/resource/Americans\">Americans</a>, which led to their executions as part of a coup that ousted <a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Salvador_Allende\">Salvador Allende</a>.<br/><br/><a href=\"http://dbpedia.org/resource/Mayor\">Mayor</a> <a href=\"http://dbpedia.org/resource/Rob_Ford\">Rob Ford</a> of <a href=\"http://dbpedia.org/resource/Toronto\">Toronto</a> returned to his job after undergoing drug and alcohol treatment, saying, \u201cMy top priority will be rebuilding trust.\u201d<br/><br/>The question is whether the new group, which now calls itself simply the <a href=\"http://dbpedia.org/resource/Islamic_State\">Islamic State</a>, will endure."
+        "A huge throng of people, mostly young, took to <a href=\"http://dbpedia.org/resource/Hong_Kong\">Hong Kong</a>\u2019s streets <a href=\"http://dbpedia.org/resource/Tuesday\">Tuesday</a>, defying <a href=\"http://dbpedia.org/resource/Beijing\">Beijing</a>\u2019s dwindling tolerance for challenges to its control.<br/><br/>The post, until now largely ceremonial, could become much more important under <a href=\"http://dbpedia.org/resource/Mr\">Mr</a>. <a href=\"http://dbpedia.org/resource/Erdogan\">Erdogan</a>, who has held power in <a href=\"http://dbpedia.org/resource/Turkey\">Turkey</a> for a decade.<br/><br/>The discovery of the teenagers\u2019 bodies in the <a href=\"http://dbpedia.org/resource/West_Bank\">West Bank</a> prompted vows of retaliation by <a href=\"http://dbpedia.org/resource/Israel\">Israel</a>, which blamed the <a href=\"http://dbpedia.org/resource/Palestinian\">Palestinian</a> group <a href=\"http://dbpedia.org/resource/Hamas\">Hamas</a> for the killings.<br/><br/><a href=\"http://dbpedia.org/resource/The_South\">The South</a> <a href=\"http://dbpedia.org/resource/African\">African</a> track star\u2019s agent and friend testified that the couple\u2019s relationship was strong and that he did not intend to kill her.<br/><br/>The <a href=\"http://dbpedia.org/resource/Japanese_prime_minister\">Japanese prime minister</a> announced that his government would reinterpret the antiwar <a href=\"http://dbpedia.org/resource/Constitution\">Constitution</a> to allow the armed forces to come to the aid of friendly nations.<br/><br/>The first clues that led to the grisly discovery of the bodies came only hours after their abduction in the <a href=\"http://dbpedia.org/resource/West_Bank\">West Bank</a> was reported.<br/><br/>The lawmakers were under pressure to name an inclusive government as insurgents mount a violent challenge north and west of <a href=\"http://dbpedia.org/resource/Baghdad\">Baghdad</a>.<br/><br/>The only viable political future for the country is federation. But <a href=\"http://dbpedia.org/resource/America\">America</a>\u2019s first priority is to see <a href=\"http://dbpedia.org/resource/ISIS\">ISIS</a> crushed.<br/><br/><a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Petro\">Petro</a> <a href=\"http://dbpedia.org/resource/O\">O</a>. <a href=\"http://dbpedia.org/resource/Poroshenko\">Poroshenko</a> said he would resume full-scale efforts to quash the pro-Russian uprising in eastern <a href=\"http://dbpedia.org/resource/Ukraine\">Ukraine</a>.<br/><br/><a href=\"http://dbpedia.org/resource/Nicolas_Sarkozy\">Nicolas Sarkozy</a>, the former <a href=\"http://dbpedia.org/resource/French_president\">French president</a>, has been under scrutiny for possible financial irregularities in his <a href=\"http://dbpedia.org/resource/2007\">2007</a> campaign and for other alleged offenses.<br/><br/><a href=\"http://dbpedia.org/resource/Myanmar\">Myanmar</a> is enjoying some new diplomatic clout, leading <a href=\"http://dbpedia.org/resource/China\">China</a> to court the country as <a href=\"http://dbpedia.org/resource/Beijing\">Beijing</a> presses its territorial claims in the <a href=\"http://dbpedia.org/resource/South_China_Sea\">South China Sea</a>.<br/><br/>The last remaining <a href=\"http://dbpedia.org/resource/African\">African</a> teams in the <a href=\"http://dbpedia.org/resource/World_Cup\">World Cup</a>, <a href=\"http://dbpedia.org/resource/Algeria\">Algeria</a> and <a href=\"http://dbpedia.org/resource/Nigeria\">Nigeria</a>, were eliminated on <a href=\"http://dbpedia.org/resource/Monday\">Monday</a>, ensuring that the continent would once again remember the <a href=\"http://dbpedia.org/resource/2014\">2014</a> event for off-the-field squabbles.<br/><br/>As <a href=\"http://dbpedia.org/resource/Hong_Kong\">Hong Kong</a> prepared for its annual pro-democracy march <a href=\"http://dbpedia.org/resource/Tuesday\">Tuesday</a>, a survey of residents found more discontent than ever with the <a href=\"http://dbpedia.org/resource/Chinese_government\">Chinese government</a>\u2019s policies toward the city, especially among the young.<br/><br/><a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Petro\">Petro</a> <a href=\"http://dbpedia.org/resource/O\">O</a>. <a href=\"http://dbpedia.org/resource/Poroshenko\">Poroshenko</a> ended a 10-day cease-fire, saying that rebels had not put down their weapons and had persisted in attacking government troops.<br/><br/>At least <a href=\"http://dbpedia.org/resource/22\">22</a> people were killed in the firefight \u2014 all of them assailants, the military said. One soldier was injured.<br/><br/>The giant <a href=\"http://dbpedia.org/resource/French\">French</a> bank admitted to transferring billions of dollars on behalf of <a href=\"http://dbpedia.org/resource/Sudan\">Sudan</a> and other countries the <a href=\"http://dbpedia.org/resource/United_States\">United States</a> has blacklisted.<br/><br/>The former chief justice of the <a href=\"http://dbpedia.org/resource/Constitutional_Court\">Constitutional Court</a> was sentenced to life in prison for corruption, the heaviest sentence ever for graft in one of the most corrupt countries in the world.<br/><br/>A former aide to former <a href=\"http://dbpedia.org/resource/Prime_Minister\">Prime Minister</a> <a href=\"http://dbpedia.org/resource/Petr_Necas\">Petr Necas</a> who later married him was found guilty of abuse of power on <a href=\"http://dbpedia.org/resource/Monday\">Monday</a> in a scandal that exposed their affair and toppled the government a year ago.<br/><br/>The court found that in <a href=\"http://dbpedia.org/resource/1973\">1973</a> an <a href=\"http://dbpedia.org/resource/American\">American</a> naval officer provided <a href=\"http://dbpedia.org/resource/Chilean\">Chilean</a> officials with information on two <a href=\"http://dbpedia.org/resource/Americans\">Americans</a>, which led to their executions as part of a coup that ousted <a href=\"http://dbpedia.org/resource/President\">President</a> <a href=\"http://dbpedia.org/resource/Salvador_Allende\">Salvador Allende</a>.<br/><br/><a href=\"http://dbpedia.org/resource/Mayor\">Mayor</a> <a href=\"http://dbpedia.org/resource/Rob_Ford\">Rob Ford</a> of <a href=\"http://dbpedia.org/resource/Toronto\">Toronto</a> returned to his job after undergoing drug and alcohol treatment, saying, \u201cMy top priority will be rebuilding trust.\u201d"
        ],
        "output_type": "pyout",
        "prompt_number": 46,
        "text": [
-        "<IPython.core.display.HTML at 0x7f706cc11910>"
+        "<IPython.core.display.HTML at 0x2cdd1d0>"
        ]
       }
      ],
--- a/notebooks/Record linkage with Nazca - Example Dbpedia - INSEE.ipynb	Tue Jul 01 14:43:49 2014 +0200
+++ b/notebooks/Record linkage with Nazca - Example Dbpedia - INSEE.ipynb	Tue Jul 01 15:10:34 2014 +0200
@@ -11,14 +11,20 @@
      "cell_type": "markdown",
      "metadata": {},
      "source": [
-      "<h2>Record linkage with Nazca - Example Dbpedia - INSEE</h2>"
-     ]
-    },
-    {
-     "cell_type": "markdown",
-     "metadata": {},
-     "source": [
-      "Imports"
+      "<h1>Record linkage with Nazca - Example Dbpedia - INSEE</h1>\n",
+      "\n",
+      "\n",
+      "This IPython notebook show some features of the Python Nazca library :\n",
+      "<ul>\n",
+      "    <li> website : <a href=\"http://www.logilab.org/project/nazca\">http://www.logilab.org/project/nazca</a></li>\n",
+      "    <li> source : <a href=\"http://hg.logilab.org/review/nazca\">http://hg.logilab.org/review/nazca</a></li>\n",
+      "</ul>\n",
+      "<ul>\n",
+      "    <li> original notebook : <a href=\"http://hg.logilab.org/review/nazca/raw-file/cdc7992b78be/notebooks/Record%20linkage%20with%20Nazca%20-%20Example%20Dbpedia%20-%20INSEE.ipynb\">here !</a></li>\n",
+      "    <li> date: 2014-07-01</li>\n",
+      "    <li> author: Vincent Michel  (<it>vincent.michel@logilab.fr</it>, \n",
+      "                                  <it>vm.michel@gmail.com</it>) @HowIMetYourData</li>\n",
+      "<ul>"
      ]
     },
     {
@@ -34,7 +40,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 1
+     "prompt_number": 4
     },
     {
      "cell_type": "markdown",
@@ -72,7 +78,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 2
+     "prompt_number": 5
     },
     {
      "cell_type": "code",
@@ -93,7 +99,7 @@
        ]
       }
      ],
-     "prompt_number": 3
+     "prompt_number": 6
     },
     {
      "cell_type": "markdown",
@@ -115,7 +121,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 4
+     "prompt_number": 7
     },
     {
      "cell_type": "code",
@@ -136,7 +142,7 @@
        ]
       }
      ],
-     "prompt_number": 5
+     "prompt_number": 8
     },
     {
      "cell_type": "markdown",
@@ -161,7 +167,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 6
+     "prompt_number": 9
     },
     {
      "cell_type": "code",
@@ -182,7 +188,7 @@
        ]
       }
      ],
-     "prompt_number": 7
+     "prompt_number": 10
     },
     {
      "cell_type": "markdown",
@@ -207,7 +213,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 8
+     "prompt_number": 11
     },
     {
      "cell_type": "markdown",
@@ -239,7 +245,7 @@
        ]
       }
      ],
-     "prompt_number": 9
+     "prompt_number": 12
     },
     {
      "cell_type": "code",
@@ -258,7 +264,7 @@
        ]
       }
      ],
-     "prompt_number": 10
+     "prompt_number": 13
     },
     {
      "cell_type": "markdown",
@@ -290,7 +296,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 11
+     "prompt_number": 14
     },
     {
      "cell_type": "markdown",
@@ -318,7 +324,7 @@
        ]
       }
      ],
-     "prompt_number": 12
+     "prompt_number": 15
     },
     {
      "cell_type": "markdown",
@@ -336,7 +342,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 13
+     "prompt_number": 16
     },
     {
      "cell_type": "markdown",
@@ -362,7 +368,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 14
+     "prompt_number": 17
     },
     {
      "cell_type": "markdown",
@@ -382,7 +388,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 15
+     "prompt_number": 18
     },
     {
      "cell_type": "markdown",
@@ -400,7 +406,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 16
+     "prompt_number": 19
     },
     {
      "cell_type": "code",
@@ -426,7 +432,7 @@
        ]
       }
      ],
-     "prompt_number": 17
+     "prompt_number": 20
     },
     {
      "cell_type": "markdown",
@@ -463,7 +469,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 18
+     "prompt_number": 21
     },
     {
      "cell_type": "code",
@@ -474,7 +480,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 19
+     "prompt_number": 22
     },
     {
      "cell_type": "code",
@@ -500,7 +506,7 @@
        ]
       }
      ],
-     "prompt_number": 20
+     "prompt_number": 23
     }
    ],
    "metadata": {}
--- a/notebooks/Record linkage with Nazca - part 1 - Introduction.ipynb	Tue Jul 01 14:43:49 2014 +0200
+++ b/notebooks/Record linkage with Nazca - part 1 - Introduction.ipynb	Tue Jul 01 15:10:34 2014 +0200
@@ -11,7 +11,19 @@
      "cell_type": "markdown",
      "metadata": {},
      "source": [
-      "<h1>Record linkage with Nazca - part 1</h1>"
+      "<h1>Record linkage with Nazca - part 1</h1>\n",
+      "\n",
+      "This IPython notebook show some features of the Python Nazca library :\n",
+      "<ul>\n",
+      "    <li> website : <a href=\"http://www.logilab.org/project/nazca\">http://www.logilab.org/project/nazca</a></li>\n",
+      "    <li> source : <a href=\"http://hg.logilab.org/review/nazca\">http://hg.logilab.org/review/nazca</a></li>\n",
+      "</ul>\n",
+      "<ul>\n",
+      "    <li> original notebook : <a href=\"http://hg.logilab.org/review/nazca/raw-file/cdc7992b78be/notebooks/Record%20linkage%20with%20Nazca%20-%20part%201%20-%20Introduction.ipynb\">here !</a></li>\n",
+      "    <li> date: 2014-07-01</li>\n",
+      "    <li> author: Vincent Michel  (<it>vincent.michel@logilab.fr</it>, \n",
+      "                                  <it>vm.michel@gmail.com</it>) @HowIMetYourData</li>\n",
+      "<ul>\n"
      ]
     },
     {
@@ -38,7 +50,7 @@
        "output_type": "pyout",
        "prompt_number": 1,
        "text": [
-        "<IPython.core.display.HTML at 0x7fd324037950>"
+        "<IPython.core.display.HTML at 0x7fb4bc036990>"
        ]
       }
      ],
@@ -150,11 +162,11 @@
        "output_type": "stream",
        "stream": "stdout",
        "text": [
-        "0.033762216568 (s)\n"
+        "0.0652930736542 (s)\n"
        ]
       }
      ],
-     "prompt_number": 10
+     "prompt_number": 3
     },
     {
      "cell_type": "markdown",
@@ -177,11 +189,11 @@
        "output_type": "stream",
        "stream": "stdout",
        "text": [
-        "3376.2216568 (s) = 0.937839349111 (h) = 0.0390766395463 (d)\n"
+        "6529.30736542 (s) = 1.81369649039 (h) = 0.0755706870997 (d)\n"
        ]
       }
      ],
-     "prompt_number": 11
+     "prompt_number": 4
     },
     {
      "cell_type": "markdown",
@@ -216,7 +228,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 12
+     "prompt_number": 5
     },
     {
      "cell_type": "code",
@@ -242,7 +254,7 @@
        ]
       }
      ],
-     "prompt_number": 13
+     "prompt_number": 6
     },
     {
      "cell_type": "markdown",
@@ -274,7 +286,7 @@
        ]
       }
      ],
-     "prompt_number": 14
+     "prompt_number": 7
     },
     {
      "cell_type": "code",
@@ -294,7 +306,7 @@
        ]
       }
      ],
-     "prompt_number": 15
+     "prompt_number": 8
     },
     {
      "cell_type": "markdown",
@@ -313,7 +325,7 @@
      "language": "python",
      "metadata": {},
      "outputs": [],
-     "prompt_number": 16
+     "prompt_number": 9
     },
     {
      "cell_type": "markdown",
@@ -345,7 +357,7 @@
        ]
       }
      ],
-     "prompt_number": 17
+     "prompt_number": 10
     },
     {
      "cell_type": "markdown",
@@ -382,7 +394,7 @@
        ]
       }
      ],
-     "prompt_number": 18
+     "prompt_number": 11
     },
     {
      "cell_type": "markdown",
@@ -414,7 +426,7 @@
        ]
       }
      ],
-     "prompt_number": 19
+     "prompt_number": 12
     },
     {
      "cell_type": "markdown",
@@ -453,7 +465,7 @@
        ]
       }
      ],
-     "prompt_number": 20
+     "prompt_number": 13
     },
     {
      "cell_type": "markdown",
@@ -477,14 +489,14 @@
      "outputs": [
       {
        "ename": "SyntaxError",
-       "evalue": "invalid syntax (<ipython-input-21-f01d54be2f60>, line 1)",
+       "evalue": "invalid syntax (<ipython-input-14-f01d54be2f60>, line 1)",
        "output_type": "pyerr",
        "traceback": [
-        "\u001b[0;36m  File \u001b[0;32m\"<ipython-input-21-f01d54be2f60>\"\u001b[0;36m, line \u001b[0;32m1\u001b[0m\n\u001b[0;31m    for sa, sb in (('abcd', 'abcd'), ('abcd', 'abce'), ('abcd', 'abc'), ('abc', 'abcd'), ('abcd', 'efgh'),,\u001b[0m\n\u001b[0m                                                                                                          ^\u001b[0m\n\u001b[0;31mSyntaxError\u001b[0m\u001b[0;31m:\u001b[0m invalid syntax\n"
+        "\u001b[0;36m  File \u001b[0;32m\"<ipython-input-14-f01d54be2f60>\"\u001b[0;36m, line \u001b[0;32m1\u001b[0m\n\u001b[0;31m    for sa, sb in (('abcd', 'abcd'), ('abcd', 'abce'), ('abcd', 'abc'), ('abc', 'abcd'), ('abcd', 'efgh'),,\u001b[0m\n\u001b[0m                                                                                                          ^\u001b[0m\n\u001b[0;31mSyntaxError\u001b[0m\u001b[0;31m:\u001b[0m invalid syntax\n"
        ]
       }
      ],
-     "prompt_number": 21
+     "prompt_number": 14
     },
     {
      "cell_type": "markdown",
@@ -518,7 +530,7 @@
        ]
       }
      ],
-     "prompt_number": 22
+     "prompt_number": 15
     },
     {
      "cell_type": "markdown",
@@ -544,7 +556,7 @@
        ]
       }
      ],
-     "prompt_number": 23
+     "prompt_number": 16
     },
     {
      "cell_type": "code",
@@ -563,7 +575,7 @@
        ]
       }
      ],
-     "prompt_number": 24
+     "prompt_number": 17
     },
     {
      "cell_type": "markdown",
@@ -590,7 +602,7 @@
        ]
       }
      ],
-     "prompt_number": 25
+     "prompt_number": 18
     },
     {
      "cell_type": "markdown",
@@ -621,7 +633,7 @@
        ]
       }
      ],
-     "prompt_number": 26
+     "prompt_number": 19
     },
     {
      "cell_type": "code",
@@ -640,7 +652,7 @@
        ]
       }
      ],
-     "prompt_number": 27
+     "prompt_number": 20
     },
     {
      "cell_type": "markdown",
@@ -722,7 +734,7 @@
        ]
       }
      ],
-     "prompt_number": 28
+     "prompt_number": 21
     }
    ],
    "metadata": {}
--- a/notebooks/Record linkage with Nazca - part 2 - Normalization and blockings.ipynb	Tue Jul 01 14:43:49 2014 +0200
+++ b/notebooks/Record linkage with Nazca - part 2 - Normalization and blockings.ipynb	Tue Jul 01 15:10:34 2014 +0200
@@ -11,7 +11,19 @@
      "cell_type": "markdown",
      "metadata": {},
      "source": [
-      "<h1>Record linkage with Nazca - part 2 - Normalization and blockings</h1>"
+      "<h1>Record linkage with Nazca - part 2 - Normalization and blockings</h1>\n",
+      "\n",
+      "ThisIPython notebook show some features of the Python Nazca library :\n",
+      "<ul>\n",
+      "    <li> website : <a href=\"http://www.logilab.org/project/nazca\">http://www.logilab.org/project/nazca</a></li>\n",
+      "    <li> source : <a href=\"http://hg.logilab.org/review/nazca\">http://hg.logilab.org/review/nazca</a></li>\n",
+      "</ul>\n",
+      "<ul>\n",
+      "    <li> original notebook : <a href=\"http://hg.logilab.org/review/nazca/raw-file/cdc7992b78be/notebooks/Record%20linkage%20with%20Nazca%20-%20part%202%20-%20Normalization%20and%20blockings.ipynb\">here !</a></li>\n",
+      "    <li> date: 2014-07-01</li>\n",
+      "    <li> author: Vincent Michel  (<it>vincent.michel@logilab.fr</it>, \n",
+      "                                  <it>vm.michel@gmail.com</it>) @HowIMetYourData</li>\n",
+      "<ul>"
      ]
     },
     {
--- a/notebooks/Record linkage with Nazca - part 3 - Putting it all together.ipynb	Tue Jul 01 14:43:49 2014 +0200
+++ b/notebooks/Record linkage with Nazca - part 3 - Putting it all together.ipynb	Tue Jul 01 15:10:34 2014 +0200
@@ -11,7 +11,19 @@
      "cell_type": "markdown",
      "metadata": {},
      "source": [
-      "<h1>Record linkage with Nazca - part 3 - Putting it all together</h1>"
+      "<h1>Record linkage with Nazca - part 3 - Putting it all together</h1>\n",
+      "\n",
+      "This IPython notebook show some features of the Python Nazca library :\n",
+      "<ul>\n",
+      "    <li> website : <a href=\"http://www.logilab.org/project/nazca\">http://www.logilab.org/project/nazca</a></li>\n",
+      "    <li> source : <a href=\"http://hg.logilab.org/review/nazca\">http://hg.logilab.org/review/nazca</a></li>\n",
+      "</ul>\n",
+      "<ul>\n",
+      "    <li> original notebook : <a href=\"http://hg.logilab.org/review/nazca/raw-file/cdc7992b78be/notebooks/Record%20linkage%20with%20Nazca%20-%20part%203%20-%20Putting%20it%20all%20together.ipynb\">here !</a></li>\n",
+      "    <li> date: 2014-07-01</li>\n",
+      "    <li> author: Vincent Michel  (<it>vincent.michel@logilab.fr</it>, \n",
+      "                                  <it>vm.michel@gmail.com</it>) @HowIMetYourData</li>\n",
+      "<ul>"
      ]
     },
     {