{"id":241,"date":"2018-02-28T11:55:32","date_gmt":"2018-02-28T10:55:32","guid":{"rendered":"http:\/\/wp.unil.ch\/llist\/?post_type=tribe_events&#038;p=241"},"modified":"2018-02-28T12:17:31","modified_gmt":"2018-02-28T11:17:31","slug":"cours-block-2017","status":"publish","type":"tribe_events","link":"https:\/\/wp.unil.ch\/llist\/event\/cours-block-2017\/","title":{"rendered":"Cours-bloc 2017"},"content":{"rendered":"<h1>Statistique textuelle et topic models<\/h1>\n<p style=\"text-align: justify\">Le cours-bloc, en fran\u00e7ais, porte sur la m\u00e9thodologie et la pratique de l&rsquo;analyse de donn\u00e9es textuelles, avec les logiciels libres <a href=\"https:\/\/textable.io\" target=\"_BLANK\">Textable<\/a>, <a href=\"https:\/\/www.iramuteq.org\/\" target=\"_BLANK\">Iramuteq<\/a> et <a href=\"https:\/\/www.r-project.org\/\" target=\"_BLANK\">R<\/a>. Cet atelier de deux jours pleins s\u2019adresse en priorit\u00e9 aux doctorant.e.s et chercheur.e.s de la Facult\u00e9 des lettres, actuellement (ou prochainement) confront\u00e9s aux donn\u00e9es textuelles dans le cadre de leur recherche. Aucun pr\u00e9requis particulier en informatique ou statistique n&rsquo;est exig\u00e9 (au-del\u00e0 de l&rsquo;utilisation basique d&rsquo;un ordinateur).<\/p>\n<h6><strong>Programme:<\/strong><\/h6>\n<table border=\"0\" cellspacing=\"0\">\n<tbody>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><b><span style=\"color: #000000\">mer. 8<br \/>\n<\/span><\/b><b><span style=\"color: #000000\">nov. 17<br \/>\n<\/span><\/b><\/td>\n<td align=\"left\" valign=\"middle\"><b><span style=\"color: #000000\">matin: salle ANT- 2012; apr\u00e8s midi: salle ANT-5183<\/span><\/b><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">09h15-09h30<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Accueil des participants, pr\u00e9sentation du programme et des intervenants<br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">09h30-10h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Bases conceptuelles, annotations, import XML et construction de matrices documents-termes avec Textable (<a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/intro_adt_ax_2017_11_08.pdf\" target=\"_BLANK\">slides<\/a>, <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/tp1_textable.pdf\" target=\"_BLANK\">TP<\/a>)<\/span><span style=\"color: #000000\"><br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">10h45-11h15<br \/>\n<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause de 30 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">11h15-12h45 <\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Expressions r\u00e9guli\u00e8res pour l&rsquo;extraction de donn\u00e9es semi-structur\u00e9es avec Textable (<a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/tp2_textable.pdf\" target=\"_BLANK\">TP<\/a>)<br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">12h45-13h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause repas de 60 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">13h45-15h15 <\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Analyse factorielle des correspondances (AFC) et classification non supervis\u00e9e (clustering) avec Iramuteq, partie I (<a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/11\/matrice_tdm_viz_clust_Iramuteq.pdf\" target=\"_BLANK\">slides<\/a>)<br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">15h15-15h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause de 30 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">15h45-17h00<br \/>\n<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Iramuteq, partie II<\/span><\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><b><span style=\"color: #000000\">jeudi 9<br \/>\n<\/span><\/b><b><span style=\"color: #000000\">nov.<\/span><\/b><b><span style=\"color: #000000\"> 17<\/span><\/b><\/td>\n<td align=\"left\" valign=\"middle\"><b><span style=\"color: #000000\">matin et apr\u00e8s-midi : salle ANT-5183<\/span><\/b><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">09h15-10h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Introduction \u00e0 <\/span><span style=\"color: #000000\">R: quelques principes g\u00e9n\u00e9raux<br \/>\nR pour le text mining (le package tm). (<a href=\"https:\/\/github.com\/yrochat\/stat_textuelle\">notebook<\/a>)<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">10h45-11h15<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause de 30 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">11h15-12h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Exemples et applications de text mining dans R.<br \/>\nBases n\u00e9cessaires pour le cours de l&rsquo;apr\u00e8s-midi. (<a href=\"https:\/\/github.com\/yrochat\/stat_textuelle\">notebook<\/a>)<br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">12h45-13h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause repas de 60 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">13h45-15h15<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Th\u00e9orie, exemples et applications du LDA et topic modelling (<a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/topicmodels.pdf\">slides<\/a>)<br \/>\n<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"21\"><span style=\"color: #000000\">15h15-15h45<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">pause de 30 minutes<\/span><\/td>\n<\/tr>\n<tr>\n<td align=\"left\" valign=\"middle\" height=\"43\"><span style=\"color: #000000\">15h45-17h00<br \/>\n<\/span><\/td>\n<td align=\"left\" valign=\"middle\"><span style=\"color: #000000\">Travail pratique topicmodels() et interpr\u00e9tation (<a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/topicmodels.zip\">topicmodels.zip<\/a>)<br \/>\n<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h6>Donn\u00e9es:<\/h6>\n<p>texte brut: <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/moliere_avare_utf8-1.txt\">moliere_avare.txt<\/a><\/p>\n<p>XML: <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/inaugural_speeches_tagged.xml_-1.zip\">inaugural_speeches_tagged.xml<\/a><\/p>\n<p>csv: <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/P3_GrantExport_with_abstracts_2012.csv\">P3_GrantExport_with_abstracts_2012<\/a> (bas\u00e9 sur <a href=\"https:\/\/p3.snf.ch\" target=\"_blank\" rel=\"noopener\">https:\/\/p3.snf.ch\/P3Export\/P3_GrantExport_with_abstracts.csv<\/a>)<\/p>\n<p>pour IRaMuTeQ: <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/textes_partis_reduit.txt\">textes_partis_reduit.txt<\/a>, <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/textes_partis_tdm.csv\">textes_partis_tdm.csv<\/a> , <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/Trois_Romans_Zola.txt\">Trois_Romans_Zola.txt<\/a> , <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/10\/Trois_Romans_Zola_tdm.csv\">Trois_Romans_Zola_tdm.csv<\/a>, <a href=\"https:\/\/wp.unil.ch\/llist\/files\/2017\/11\/fns_fr_depuis_2007.txt\">fns_fr_depuis_2007.txt<\/a><\/p>\n<p>pour R (jeudi matin): <a href=\"https:\/\/github.com\/yrochat\/stat_textuelle\">cliquer ici<\/a> puis sur le bouton vert \u00ab\u00a0Clone or download\u00a0\u00bb puis \u00ab\u00a0Download ZIP\u00a0\u00bb puis ouvrir le \u00ab\u00a0.Rmd\u00a0\u00bb dans RStudio ou le \u00ab\u00a0.nb.html\u00a0\u00bb dans un navigateur.<\/p>\n<h6>Logiciels:<\/h6>\n<p>Les logiciels seront tous disponibles sur les ordinateurs des salles de cours.<br \/>\nIls sont tous sous licence open source et peuvent \u00eatre install\u00e9s sur vos machines personnelles en suivant les instructions ci-dessous:<\/p>\n<p><a href=\"https:\/\/textable.io\" target=\"_blank\" rel=\"noopener\">Textable<\/a>: instructions d&rsquo;installation: <a href=\"https:\/\/textable.io\/get-started\/\" target=\"_blank\" rel=\"noopener\">textable.io\/get-started<\/a><a href=\"https:\/\/cran.r-project.org\/\" target=\"_blank\" rel=\"noopener\"><br \/>\nR<\/a>: lien de t\u00e9l\u00e9chargement: <a href=\"https:\/\/cran.r-project.org\/banner.shtml\" target=\"_blank\" rel=\"noopener\">cran.r-project.org\/banner.shtml<\/a> (pour les mac: installez <a href=\"https:\/\/www.xquartz.org\/\">Xquarz<\/a>)<br \/>\n<a href=\"https:\/\/www.rstudio.com\">Rstudio<\/a>: lien de t\u00e9l\u00e9chargement <a href=\"https:\/\/www.rstudio.com\/products\/RStudio\">www.rstudio.com\/products\/RStudio<\/a><br \/>\n<a href=\"https:\/\/iramuteq.org\" target=\"_blank\" rel=\"noopener\">Iramuteq<\/a>: lien de t\u00e9l\u00e9chargement: <a href=\"https:\/\/sourceforge.net\/projects\/iramuteq\/\" target=\"_blank\" rel=\"noopener\">sourceforge.net\/projects\/iramuteq<\/a><\/p>\n<p>Installer des modules sur R (apr\u00e8s avoir install\u00e9 R lui-m\u00eame):<br \/>\n1. Ouvrez une \u00ab\u00a0invite de commandes\u00a0\u00bb (Windows: cmd.exe, Linux\/Mac: Terminal)<br \/>\n2. Tapez: <code>R?<\/code><br \/>\n3. Utilisez le code suivant dans R:<\/p>\n<pre>install.packages(c(\"FactoMineR\", \"ca\", \"tm\", \"topicmodels\"), repos='https:\/\/stat.ethz.ch\/CRAN\/')<\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Statistique textuelle et topic models Le cours-bloc, en fran\u00e7ais, porte sur la m\u00e9thodologie et la pratique de l&rsquo;analyse de donn\u00e9es textuelles, avec les logiciels libres Textable, Iramuteq et R. Cet &hellip; <\/p>\n","protected":false},"author":1001641,"featured_media":0,"template":"","meta":{"_seopress_robots_primary_cat":"","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","_tribe_events_status":"","_tribe_events_status_reason":"","footnotes":""},"tags":[],"tribe_events_cat":[4],"class_list":["post-241","tribe_events","type-tribe_events","status-publish","tribe_events_cat-cours-bloc","cat_cours-bloc"],"_links":{"self":[{"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tribe_events\/241","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tribe_events"}],"about":[{"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/types\/tribe_events"}],"author":[{"embeddable":true,"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/users\/1001641"}],"version-history":[{"count":2,"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tribe_events\/241\/revisions"}],"predecessor-version":[{"id":245,"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tribe_events\/241\/revisions\/245"}],"wp:attachment":[{"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/media?parent=241"}],"wp:term":[{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tags?post=241"},{"taxonomy":"tribe_events_cat","embeddable":true,"href":"https:\/\/wp.unil.ch\/llist\/wp-json\/wp\/v2\/tribe_events_cat?post=241"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}