{"id":3189,"date":"2022-03-02T21:31:45","date_gmt":"2022-03-02T21:31:45","guid":{"rendered":"https:\/\/akyalab.com\/?p=3189"},"modified":"2022-03-02T21:31:47","modified_gmt":"2022-03-02T21:31:47","slug":"tools-to-master-data-science","status":"publish","type":"post","link":"https:\/\/akyalab.com\/fr\/tools-to-master-data-science\/","title":{"rendered":"Outils pour ma\u00eetriser la science des donn\u00e9es"},"content":{"rendered":"<h3 class=\"wp-block-heading\">Introduction:<\/h3>\n\n\n\n<p>Do you remember that data science is the combination of mathematics, statistics, and programming skills? We will see in this article which tools do we need to master data science. But I warn you that these tools are not the only ones and that you can use others at your convenience.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">1. Python<\/h3>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"100\" height=\"100\" src=\"http:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/python-e1646255859322.png\" alt=\"\" class=\"wp-image-3190\"\/><\/figure>\n\n\n\n<p>Python is an interpreter based high-level programming language. Python is a versatile language. It is mostly used for Data Science and Software Development. Python has gained popularity due to its ease of use and code readability.&nbsp;<\/p>\n\n\n\n<p>As a result, Python is widely used for Data Analysis, Natural Language Processing, and Computer Vision. Python comes with various graphical and statistical packages like Matplotlib, Numpy, SciPy and more advanced packages for Deep Learning such as TensorFlow, PyTorch, Keras etc.&nbsp;<\/p>\n\n\n\n<p>For the purpose of data mining, wrangling, visualizations and developing predictive models, we utilize Python. This makes Python a very flexible programming language.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">2. R<\/h3>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"100\" height=\"100\" src=\"http:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/r-e1646255988375.png\" alt=\"\" class=\"wp-image-3191\"\/><\/figure>\n\n\n\n<p><strong>R <\/strong>is a scripting language\u00a0that is specifically tailored for statistical computing.\u00a0It is widely used for data analysis, statistical modeling, time-series forecasting, clustering etc. R is mostly used for statistical operations.<\/p>\n\n\n\n<p>It also possesses the features of an object-oriented programming language.\u00a0R is an interpreter based language and is widely popular across multiple industries<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">3. SQL<\/h3>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"100\" height=\"100\" src=\"http:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/serveur-sql-e1646256130723.png\" alt=\"\" class=\"wp-image-3192\"\/><\/figure>\n\n\n\n<p>SQL stands for Structured Query Language. Data Scientists use SQL for managing and querying data stored in databases. Being able to extract information from databases is the first step towards analyzing the data. Relational Databases are a collection of data organized in tables.&nbsp;<\/p>\n\n\n\n<p>We use SQL for extracting, managing and manipulating the data. For example A Data Scientist working in the banking industry uses SQL for extracting information of customers. While Relational Databases use SQL, \u2018NoSQL\u2019 is a popular choice for non-relational or distributed databases.&nbsp;<\/p>\n\n\n\n<p>Recently NoSQL has been gaining popularity due to its flexible scalability, dynamic design, and open source nature. MongoDB, Redis, and Cassandra are some of the popular NoSQL languages.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">4. Hadoop<\/h3>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"266\" src=\"https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo-1024x266.png\" alt=\"\" class=\"wp-image-3193\" srcset=\"https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo-1024x266.png 1024w, https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo-300x78.png 300w, https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo-768x199.png 768w, https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo-18x5.png 18w, https:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/1280px-Hadoop_logo.png 1280w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Big data is another trending term that deals with management and storage of huge amount of data.&nbsp;Data is either structured or unstructured.&nbsp;A Data Scientist must have a familiarity with complex data and must know tools that regulate the storage of massive datasets.&nbsp;<\/p>\n\n\n\n<p>One such tool is Hadoop. While being open-source software, Hadoop utilizes a distributed storage system using a model called \u2018MapReduce\u2019. There are several packages in Hadoop such as Apache Pig, Hive, HBase etc.<\/p>\n\n\n\n<p>Due to its ability to process colossal data quickly, its scalable architecture and low-cost deployment,<a href=\"https:\/\/data-flair.training\/blogs\/hadoop-tutorial\/\">\u00a0<strong>Hadoop has grown to become the most popular software for Big Data<\/strong><\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">5. Tableau<\/h3>\n\n\n\n<p>Tableau is a Data Visualization software specializing in graphical analysis of data. It allows its users to create interactive visualizations and dashboards.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"576\" src=\"http:\/\/akyalab.com\/wp-content\/uploads\/2022\/03\/Tableau-Emblem-1024x576.png\" alt=\"\" class=\"wp-image-3194\"\/><\/figure>\n\n\n\n<p>This makes Tableau an ideal choice for showing various trends and insights of the data in the form of interactable charts such as Treemaps, Histograms, Box plots etc. An important feature of Tableau is its ability to connect with spreadsheets, relational databases, and cloud platforms.&nbsp;<\/p>\n\n\n\n<p>This allows Tableau to process data directly, making it easier for the users.<\/p>","protected":false},"excerpt":{"rendered":"<p>Introduction\u00a0: Vous souvenez-vous que la science des donn\u00e9es est la combinaison de math\u00e9matiques, de statistiques et de comp\u00e9tences en programmation\u00a0? Nous verrons dans cet article de quels outils avons-nous besoin pour ma\u00eetriser la science des donn\u00e9es. Mais je vous pr\u00e9viens que ces outils ne sont pas les seuls et que vous pouvez en utiliser d'autres \u00e0 votre convenance. 1.PythonPython [\u2026]<\/p>","protected":false},"author":3,"featured_media":3195,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/posts\/3189"}],"collection":[{"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/comments?post=3189"}],"version-history":[{"count":1,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/posts\/3189\/revisions"}],"predecessor-version":[{"id":3196,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/posts\/3189\/revisions\/3196"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/media\/3195"}],"wp:attachment":[{"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/media?parent=3189"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/categories?post=3189"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/akyalab.com\/fr\/wp-json\/wp\/v2\/tags?post=3189"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}