La "data factory"

Sid Ahmed Benabderrahmane

Sid Ahmed Benabderrahmane
Post-doctoral fellow in computer sciences

Training

  • Doctor of Philosophy (PhD), in Computer Science, Nancy University, France. Title: Using domain knowledge for mining transcriptomic data: semantic similarity, functional classification and fuzzy profiles.
  • Master of Science (MSc), in Computer Science, Evry University, France.

Publications (journal papers) / Awards

  • Smart4Job: A Big Data Framework for Intelligent Job Offers Broadcasting Using Time Series Forecasting and Semantic Classification. Journal of Big Data Research. http://dx.doi.org/10.1016/j.bdr.2016.11.001, 2016.
  • On the Use of Local Patterns and Big-Data Dimensionality Reduction Methods for Facial Poses Classification. International Journal of Computers JCP. 13(1), 2016.
  • Extending Space Filling Curves and Symbolic Time Series Representation for Facial Pose Classification.  International Journal of Pattern Recognition and Artificial Intelligence, 2016.
  • Functional classification of genes using semantic distance and fuzzy clustering approach: evaluation with reference sets and overlap analysis. International Journal of Computational Biology and Drug Design, 2012;5(3-4):245-60.
  • IntelliGO: a new vector-based semantic similarity measure including annotation origin. BMC Bioinformatics, 11(1):588, 2010.
  • More information in Google Scholar

Research Fields

Data Mining, Machine Learning, Big Data, Knowledge Discovery.

Methodologies and tools

Methodologies:

  • Data transformation, management and analysis using machine learning and data mining methods to the aim of knowledge discovery: Structured, Semi structured, Unstructured Data.
  • Supervised and non supervised machine learning methods: Classification, Clustering, Forecasting, Item-sets, Pattern Recognition.

Tools:

  • C++, R, Java, Python, Web, Databases, Big Data Frameworks (Hadoop, Spark).

Julien Brault


Julien Brault

Post-doctoral fellow in economics

Training

PhD, International Studies, Graduate Institute, Geneva PhD Thesis : The International Transactions of France in the 20th Century

Publications / Awards

  • Job market paper : French Oil Protectionism and the International Political Economy of Rent Seeking
  • Second PhD paper : The Political Economy of French Foreign Exchange Control »
  • ECB working paper : Deleveraging in the NFC sector : Some Evidence from Firm-level Data
  • Other publications :
    • Oil Roads and Oil Rent, in Oil Roads (dir. A. Beltran), 2014
    • Oil and the Imbalances of French Development, French Economic History Review, 2015
    • Technip from 1958 to 2008, the Emergence of a National Champion of Oil Engineering, Entreprise et histoire, 2013

Research Fields

International Economic Relations, Political Economy, Rent Seeking, Balances of Payments, Trade Restrictionss, Private Debt, Energy

Methodologies and tools

Julien is specialized in historical data and analysis. He masters historical data sources, both with online access and drawn from archives. He has a good knowledge of international economic archives. He can handle questions related to extraction of archival data and conflicts and merging between long run data series. Julien has a good knowledge of Stata and masters Tableau.

Bruno Chaves

Bruno Chaves
Coordinator of the project

A double background in computer sciences and economics / econometrics

  • CNRS Engineer in Information Technology and project manager since 10 years. (Nanterre University, Ecole Normale Supérieure and Dauphine University)
  • Previously : Research and teaching assistant in econometrics / statistics (Sorbonne University)

Coordinator for several international research projects :

  • SIOE - Society for Institutional & Organizational Economics (coordinator since 2007)
  • IOEA - Institutional & Organizational Economics Academy (co-coordinator since 2002)
  • DIME - Dynamics of Institutions and Markets in Europe (IT manager 2005-2011)

Methodologies and tools

  • Project management, object and test driven development, continuous integration methodologies.
  • Full stack web development (JavaScript, PHP, CSS, HTML5, GIT, SQL databases, responsive web, etc.). Practice of Java, Phython and R.

Nada Mimouni

Nada Mimouni
Post-doctoral fellow in computer sciences

Training

  • PhD, Université Paris 13 , LIPN, RCLN team Querying a semantic network of documents: intertextuality in legal information access
  • Research engineer, Technische Universität Darmstadt - Germany, UKP NLP & Semantic IR for Question Answering
  • Master in computer science, INRIA-LORIA , ORPAILLEUR

Publications

  • A Conceptual Approach for Relational IR: Application to Legal Collections. International conference on Formal Concept Analysis - ICFCA 2015
  • Search and discovery in legal document networks. International Conference on Legal Knowledge and Information Systems - JURIX 2015
  • Une ontologie documentaire pour l'accès aux contenus juridiques. 26es Journées francophones d'Ingénierie des Connaissances - IC 2015
  • An Approach for Searching and Browsing a Network of Legal Documents, Law Science Technology Review, 2014
  • Towards Graph-based and Semantic Search in Legal Information Access Systems, JURIX 2014

Research Fields

Document network analysis, Intertextuality, Information Retrieval, Semantic web, Ontology design

Methodologies

  • Knowledge engineering
  • Semantic web, Ontologie design, Semantic technologies
  • Data linking
  • Application domains : Legal texts, Relational querying, Governance analytics

Main Programming langages and Tools

  • Java, R, Python, RDF/RDFs, OWL, SPARQL, TopBraid Composer, Protégé

Timothy Yeung

Timothy Yeung
Post-doctoral fellow in economics

Training

PhD (Toulouse School of Economics, 2015), Thesis: Essays on Political Economy

Publications / Award

  • Political philosophy, executive constraint and electoral rules, Journal of Comparative Economics, 2016 (forthcoming). http://dx.doi.org/10.1016/j.jce.2016.10.006
  • Other working papers :

    • Understanding Airbnb in 14 European Cities, coauthored with Diane Coyle
    • Do Majoritarian Rules Favour Larger Industries in the Economy, coauthored with Izaskun Zuazu
    • A Cheap-talk Model with Multiple Free-riding Audiences

Research Fields

Political Economy, Public Economics, Industrial Organization, Applied Microeconometrics

Methodologies and tools

  • Econometrics: hypothesis testing, identification of causality and program evaluation
  • Types of Data: Cross-sectional, short and long panels, survival analysis
  • Statistical Softwares: Stata, SAS