美文网首页大数据
013 2019 年度最佳大数据分析工具-使用和限制

013 2019 年度最佳大数据分析工具-使用和限制

作者: 胡巴Lei特 | 来源:发表于2019-07-27 21:52 被阅读0次

    013 10 Best Big Data Analytics Tools for 2019 – With Uses & Limitations

    1. Best Big Data Analytics Tools

    1. 最好的大数据分析工具

    In this blog on Best ** Big Data Analytics tools** , we will learn about Best Data Analytic Tools. Also, will study these Data Analysis Tools: Tableau Public, OpenRefine, KNIME, RapidMiner, Google Fusion Tables, NodeXL, Wolfram Alpha, Google Search Operators, Solver, Dataiku DSS with their uses, limitations, and description.

    在这个博客上 大数据分析工具 我们将学习最好的数据分析工具. 还将研究这些数据分析工具包括: Tableau Public,OpenRefine,KNIME,RapidMiner,Google Fusion Tables,NodeXL,Wolfram Alpha,Google Search Operators,Solver,Dataiku 决策支持系统的使用、限制条件.

    Best Big Data Analytics Tools

    Best Big Data Analytics Tools

    2. List of Big Data Analytics Tools

    2. 大数据分析工具

    Data Analytics is the process of analysing datasets to draw results, on the basis of information they get. It is popular in commercial industries, scientists and researchers to make a more informed business decision and to verify theories, models and hypothesis.

    数据分析是根据数据集获得的信息分析数据集以得出结果的过程. 在商业行业、科学家和研究人员中,做出更明智的商业决策并验证理论、模型和假设是很受欢迎、同时也机器重要的数据应用、数据价值.

    These are the 10 Best Big Data Analytic Tools with their uses and limitations, which can help you to analyse the data. Let’s discuss them one by one:

    这些是 10 个最好的大数据分析工具,它们的用途和局限性可以帮助你分析数据. 让我们逐一讨论一下:

    • Tableau Public
    • OpenRefine
    • KNIME
    • RapidMiner
    • Google Fusion Tables
    • NodeXL
    • Wolfram Alpha
    • Google Search Operators
    • Solver
    • Dataiku DSS

    a. Tableau Public

    i. What is Tableau Public – Big Data Analytics Tools

    It is a simple and intuitive tool. As it offers intriguing insights through data visualization. Tableau Public’s million-row limit. As it’s easy to use fares better than most of the other players in the data analytics market.

    这是一个简单直观的工具.因为它通过数据可视化提供了有趣的见解.Tableau Public 的百万行限制因为它比数据分析市场上的大多数其他参与者更易于使用.

    With Tableau’s visuals, you can investigate a hypothesis. Also, explore the data, and cross-check your insights.

    借助 Tableau 的视觉效果,你可以研究一个假设.此外,探索数据,并交叉检查你的见解.

    ii. Uses of Tableau Public

    • You can publish interactive data visualizations to the web for free.

    • 您可以免费向 web 发布交互式数据可视化.

    • No programming skills required.

    • 不需要编程技能.

    Visualizations published to Tableau Public can be embedded into blogs. Also, web pages and be shared through email or social media. The shared content can be made available s for downloads. This makes it the best Big Data Analytics tools.

    发布到 Tableau Public 的可视化 被嵌入进入博客.此外,网页和 被分享通过电子邮件或社交媒体.分享的内容可供下载.这是最好的大数据分析工具.

    iii. Limitations of Tableau Public

    Iii.Tableau Public 的局限性

    • All data is public and offers very little scope for restricted access

    • 所有数据都是公开的,限制访问的范围很小

    • Data size limitation

    • 数据大小限制

    • Cannot be connected to R.

    • 不能被连接到 R.

    • The only way to read is via OData sources, is Excel or txt.

    Follow this link to know more about Tableau

    • 读取的唯一方法是通过 OData 源,Excel 或 txt.

    跟随这个链接了解更多关于 Tableau 的信息

    译者注: 我司使用付费版 Tableau,性价比不高,大集团可以选择

    b. OpenRefine

    i. What is OpenRefine – Data Analytic Tools

    Formerly known as GoogleRefine, the data cleaning software. As it helps you clean up data for analysis. It operates on a row of data. Also, have cells under columns, quite similar torelational database tables.

    以前的被称为数据清理软件 GoogleRefine.因为它可以帮助你清理数据进行分析.它对一行数据进行操作.此外,在列下有单元格,类似于关系数据库表.

    ii. Uses of OpenRefine

    • Cleaning messy data

    • Transformation of data

    • Parsing data from websites

    • 清理杂乱数据

    • 数据的转换

    • 从网站解析数据

    Adding data to the dataset by fetching it from web services. For instance, OpenRefine could be used for geocoding addresses to geographic coordinates.

    通过从 web 服务获取数据,将数据添加到数据集.例如,OpenRefine 可以 被使用地理坐标的地理编码地址.

    iii. Limitations of OpenRefine

    • Open Refine is unsuitable for large datasets.

    • Open Refine 不适合大型数据集.

    • Refine does not work very well with big data

    • 与 Refine 一起工作不太好大数据

    c. KNIME

    i. What is KNIME – Data Analysis Tools

    KNIME helps you to manipulate, analyze, and model data through visual programming. It is used to integrate various components for data mining and machine learning.

    KNIME 通过可视化编程帮助您操作、分析和建模数据.它 被使用集成各种组件 数据挖掘机器学习.

    ii. Uses of KNIME

    • Don’t write blocks of code. Rather, you have to drop and drag connection points between activities.

    • 不要写代码块.相反,您必须在活动之间拖放连接点.

    • This data analysis tool supports programming languages.

    • 这个数据分析工具支持编程语言.

    In fact, analysis tools like these can be extended to run chemistry data, text mining, python, and R.

    事实上,像这样的分析工具可以 被延长运行化学数据,文本挖掘, Python,和 R.

    iii. Limitation of KNIME

    • Poor data visualization

    • 数据可视化差

    d. RapidMiner

    i. What is RapidMiner – Data Analytic Tools

    RapidMiner provides machine learning procedures. And data mining including data visualization, processing, statistical modeling and predictive analytics.

    提供 RapidMiner 机器学习程序.和 数据挖掘包括数据可视化、处理、统计建模和预测分析.

    RapidMiner written in Java is fast gaining acceptance as a Big data analytics tool.

    写在的 RapidMiner Java作为大数据分析工具,它正在迅速获得认可.

    ii. Uses of RapidMiner

    • It provides an integrated environment for business analytics, predictive analysis.

    • Along with commercial and business applications, it is also used for application development.

    • 它为业务分析、预测分析提供了一个集成的环境.

    • 除了商业和商业应用之外,它还用于应用程序开发.

    iii. Limitations of RapidMiner

    • RapidMiner has size constraints with respect to the number of rows.

    • 尺寸限制 RapidMiner关于行数.

    • For RapidMiner, you need more hardware resources than ODM and SAS.

    • 与 ODM 和 SAS 相比,RapidMiner 需要更多的硬件资源.

    e. Google Fusion Tables

    i. What is Google Fusion Tables

    When comes to data tools, we have a cooler, larger version of Google Spreadsheets. An incredible tool for data analysis, mapping, and large dataset visualization. Also, Google Fusion Tables can be added to business analytics tools list. This is also one of the best Big Data Analytics tools.

    说到数据工具,我们有一个更酷、更大版本的 Google 电子表格.用于数据分析、映射和大数据集可视化的令人难以置信的工具.此外,Google Fusion Tables 可以 被添加业务分析工具列表.这也是最好的大数据分析工具之一.

    ii. Uses of Google Fusion Tables

    • Visualize bigger table data online.

    • 在线可视化更大的表格数据.

    • Filter and summarize across hundreds of thousands of rows.

    • 对成千上万行进行过滤和汇总.

    • Combine tables with other data on the web

    • 将表与 web 上的其他数据相结合

    • You can merge two or three tables to generate a single visualization that includes sets of data.

    • 您可以合并两个或三个表,以生成包含数据集的单个可视化.

      • You can create a map in minutes!
      • 您可以在几分钟内创建地图!

    iii. Limitations of Google Fusion Tables

    • Only the first 100,000 rows of data in a table are included in query results or mapped.

    • 表中只有前 100,000 行数据包括在内在查询结果或映射.

    • The total size of the data sent in one API call cannot be more than 1MB.

    • 在一次 API 调用中发送的数据的总大小不能超过 1 MB.

    f. NodeXL

    i. What is NodeXL

    It is a visualization and analysis software of relationships and networks. NodeXL provides exact calculations. It is a free (not the pro one) and open-source network analysis and visualization software. NodeXL is one of the best statistical tools for data analysis. In which includes advanced network metrics. Also, access to social media network data importers, and automation.

    它是一个关系和网络的可视化和分析软件.NodeXL 提供了精确的计算.它是一个免费的 (不是专业的) 开源的网络分析和可视化软件.NodeXL 是数据分析的最佳统计工具之一.其中包括高级网络指标.此外,访问社交媒体网络数据进口商和自动化.

    ii. Uses of NodeXL

    This is one of the data analysis tools in Excel that helps in the following areas:

    这是 Excel 中有助于以下领域的数据分析工具之一:

    • Data Import

    • Graph Visualization

    • Graph Analysis

    • Data Representation

    This software integrates into Microsoft Excel 2007, 2010, 2013, and 2016. It opens as a workbook with a variety of worksheets containing the elements of a graph structure. That is like nodes and edges.

    该软件集成到 Microsoft Excel 2007 、 2010 、 2013 和 2016 中.它以工作簿的形式打开,其中包含图形结构元素的各种工作表.就像节点和边缘一样.

    This software can import various graph formats. Such adjacency matrices, Pajek .net, UCINet .dl, GraphML, and edge lists.

    这个软件可以导入各种图形格式.这样的邻接矩阵,Pajek.净,UCINet.Dl 、 GraphML 和边缘列表.

    iii. Limitations of NodeXL

    • You need to use multiple seeding terms for a particular problem.

    • 你需要使用多重特定问题的播种术语.

    • Running the data extractions at slightly different times.

    • 在运行数据提取稍微不同的时间

    g. Wolfram Alpha

    i. What is Wolfram Alpha

    It is a computational knowledge engine or answering engine founded by Stephen Wolfram.

    它是由 Stephen Wolfram 创建的计算知识引擎或应答引擎.

    ii. Uses of Wolfram Alpha

    • Is an add-on for Apple’s Siri

    • Provides detailed responses to technical searches and solves calculus problems.

    • 为技术搜索提供详细的响应,并解决微积分问题.

    • Helps business users with information charts and graphs. And helps in creating topic overviews, commodity information, and high-level pricing history.

    • 通过信息图表帮助业务用户.并有助于创建主题概述、商品信息和高级定价历史.

    iii. Limitations of Wolfram Alpha

    • Wolfram Alpha can only deal with a publicly known number and facts, not with viewpoints.

    • Wolfram Alpha 只能处理一个公开已知的数字和事实,而不是观点.

    • It limits the computation time for each query.

    • 它限制了每个查询的计算时间.

    Any doubt in these Statistical tools for Data Analysis? Please Comment.

    这些数据分析的统计工具有什么疑问吗?请评论.

    h. Google Search Operators

    i. What is Google Search Operators

    It is a powerful resource which helps you filter Google results. That instantly to get most relevant and useful information.

    这是一个强大的资源,它可以帮助你过滤谷歌的结果.那个 瞬间获取最相关和最有用的信息.

    ii. Uses of Google Search Operators

    • Faster filtering of Google search results

    • Google 搜索结果的快速过滤

    • Google’s powerful data analysis tool can help discover new information.

    • Google 强大的数据分析工具可以帮助发现新的信息.

    i. Solver

    i. What is Excel Solver

    The Solver Add-in is a Microsoft Office Excel add-in program. Also, it is available when you install Microsoft Excel or Office. It is a linear programming and optimization tool in excel.

    Solver 插件是 Microsoft Office Excel 插件程序.此外,当您安装 Microsoft Excel 或 Office 时,它也是可用的.它是 excel 中的线性规划和优化工具.

    This allows you to set constraints. It is an advanced optimization tool that helps in quick problem-solving.

    这允许您设置约束.它是一种先进的优化工具,有助于快速解决问题.

    ii. Uses of Solver

    • the final values found by Solver are a solution to interrelation and decision.

    • Solver 找到的最终值是解决相互关系和决策的方法.

    • It uses a variety of methods, from nonlinear optimization. And also linear programming to evolutionary and genetic algorithms, to find solutions.

    • 它使用了多种方法,从非线性优化.线性规划,进化和遗传算法,找到解决方案.

    iii. Limitations of Solver

    • Poor scaling is one of the areas where Excel Solver lacks.

    • It can affect solution time and quality.

    • Solver affects the intrinsic solvability of your model.

    • Excel Solver 缺乏的领域之一是缩放不良.

    • 它会影响解决方案的时间和质量.

    • Solver 会影响模型的内在可懂度.

    j. Dataiku DSS

    i. What is Dataiku DSS

    This is a collaborative data science software platform. Also, it helps a team build, prototype, explore. Although, it deliver their own data products more efficiently.

    这是一个协同数据科学软件平台.此外,它还有助于团队构建、原型和探索.虽然,它提供自己的数据产品更多 高效.

    ii. Uses of Dataiku DSS

    Dataiku DSS Data analytic tools provide an interactive visual interface. As in this they can build, click, and point or use languages like SQL.

    Dataiku DSS -数据分析工具提供了一个交互式的可视化界面.因此,他们可以构建、单击、指向或使用 SQL 等语言.

    iii. Limitation of Dataiku DSS

    • Limited visualization capabilities

    • UI hurdles: Reloading of code/datasets

    • Inability to easily compile entire code into a single document/notebook

    • Still, need to integrate with SPARK

    • 可视化能力有限

    • UI 障碍: 重新加载代码/数据集

    • 不能轻松将整个代码编译成一个文档/笔记本

    • 还是需要和 SPARK 整合

    These were the top data analytics tools and this is all on Best Big Data Analytics tools.

    这些都是顶级的数据分析工具,这都是最好的大数据分析工具.

    3. Conclusion: Big Data Analytics tools

    As a result, we have studied Big Data Analytic Tools. Also, we learned these Data Analysis Tools: Tableau Public, OpenRefine, KNIME, RapidMiner, Google Fusion Tables, NodeXL, Wolfram Alpha, Google Search Operators, Solver, Dataiku DSS uses, limitations along with a description.

    因此,我们研究了大数据分析工具.此外,我们认识到,这些数据分析工具: Tableau Public,OpenRefine,KNIME,RapidMiner,Google Fusion Tables,NodeXL,Wolfram Alpha,Google Search Operators,Solver,Dataiku DSS 使用,限制图.

    I hope this blog on analytics tools will help you to understand Data Analytic Tools. Data Analytic Tools is a booming topic nowadays. Furthermore, if you have any query regarding Big data analytics tools, feel free to ask in a comment section.

    我希望这篇关于分析工具的博客能帮助你理解数据分析工具.数据分析工具是当今一个蓬勃发展的话题.此外,如果您对大数据分析工具有任何疑问,请在评论部分提出.

    Best Data Analysis Software Systems For 2019

    面向 2019 的最佳数据分析软件系统

    https://data-flair.training/blogs/best-big-data-analytics-tools

    相关文章

      网友评论

        本文标题:013 2019 年度最佳大数据分析工具-使用和限制

        本文链接:https://www.haomeiwen.com/subject/ypdyrctx.html