Data Science or not Data Science? : Différence entre versions
De BIGDATA
Ligne 17 : | Ligne 17 : | ||
'''Apache Spark''' | '''Apache Spark''' | ||
− | + | [[Apache Spark]] | |
'''Grid5000''' | '''Grid5000''' |
Version du 29 janvier 2016 à 14:39
Welcome on LIPN Wiki about Big Data
With more and more data, we need to use the right technologies in order be able to analyze large amount of data.
Big Data is often caracterize by the 4 V for Volume, Variety, Velocity, Veracity.
To extract knowledge from data we use machine learning, it's a family of algorithms which transforms data into model or description in order to predict or categorize data.
We use also analytics tools which consist to presents informations in a more readable way.
Tools we use
Apache Spark : http://spark.apache.org/ Apache Flink : https://flink.apache.org/ TenserFlow : https://www.tensorflow.org/ Grid5000 : https://www.grid5000.fr/mediawiki/index.php/Grid5000:Home Wendelin : http://www.nexedi.com/NXD-Document.Blog.Wendelin.Release.0.4.alpha
Apache Spark Apache Spark
Grid5000
MediaWiki a été installé avec succès.
Consultez le Guide de l’utilisateur pour plus d’informations sur l’utilisation de ce logiciel de wiki.