015 Beware! Income Tax Department is Using Big Data Analytics to Track Expenditure Patterns
The applications of big data are gaining a strong foothold in every industry. Recently, the Government of India has adopted Big Data in its fight against tax evaders. This will make tax evasion more difficult. The programme – “Project Insight”, costing 1,000 crores has been in development for the past three years. Tax evasion was always difficult to monitor and manage given the traditional tools. However, with several countries like USA and UK already using big data for tax management, India has also joined this league. In this article, we will see how Big Data is helping to catch tax evaders.
大数据的应用在各个行业都获得了强劲的立足点. 最近,印度政府在打击逃税者的斗争中采用了大数据. 这将使逃税变得更加困难. 耗资 1,000 卢比的 “项目洞察” 计划在过去三年里一直在开发中. 鉴于传统的工具,逃税总是很难监控和管理. 然而,随着美国和英国等几个国家已经将大数据用于税收管理,印度也加入了这个联盟. 在这篇文章中,我们将看到大数据是如何帮助逃税者的.
Income Tax Department Will Use Big Data to Catch Tax Evaders
所得税部门将利用大数据抓逃税者
In India problems like failure to pay tax dues, submission of false tax returns, inaccuracy in financial statements, lack of income reports, storing wealth in foreign countries is becoming an increasing trend. According to Finance Minister Arun Jaitley, 10 million Indians were exempt out of 37 million Indians who filed for tax returns. This leaves just 27 million to pay taxes. This accounts for a little more than 2% of the entire country. This is an abysmally low percentage of tax-payers in India.
在印度,不缴纳税款、提交虚假纳税申报表、财务报表不准确、缺乏收入报告、在外国储存财富等问题正成为一种日益增长的趋势.根据 财政部长 Arun Jaitley 在 1000万名申请纳税申报的印度人中,有 3700万名印度人被免税.这只剩下 2700万的税款.这个数字占全国的 2% 多一点. 这是印度纳税人的比例非常低.
Big Data in income tax departmentSource – News18
Many times, people purchase products or indulge in a lavish lifestyle beyond their monetary means as per their tax returns. This does not often come up on the radar of tax officials as it is not possible to analyze all of the information on a manual level. However, with the emergence of Big Data and Data Analytics technologies, it is now possible to analyze all of the large scale information.
很多时候,根据纳税申报表,人们购买产品或享受超出他们货币手段的奢华生活方式.这并不经常出现在税务官员的雷达上,因为不可能在人工水平上分析所有信息.然而,随着互联网的出现,* 大数据与数据分析技术*, 现在可以分析所有的大规模信息.
The 2016 operation to curb the usage of black money provided a key revelation to the Income Tax Department to upgrade their tax collection procedures and utilize an efficient methodology to track tax evaders. This led to the inception of* Project Insight. India is now one of the several countries that use Big Data to track tax evasion. In the United States, IRS has been using Big Data on its phone surveillance records, tracking social media accounts and using extensive data mining to develop analytical algorithms for identification of tax compliance issues. Similarly, the UK Government had implemented big data analytics* in their income tax segment and observed an increase of $ 5.4 billion in tax return revenue.
2016 遏制黑钱使用的行动为所得税部门升级税收程序和利用有效的方法跟踪逃税者提供了一个关键启示. 这导致了 项目洞察. 印度现在是利用大数据追踪逃税的几个国家之一. 在美国,国税局一直在电话监控记录上使用大数据,跟踪社交媒体账户,并广泛使用 数据挖掘开发识别税务合规问题的分析算法. 同样的, 英国政府实施大数据分析 在他们的所得税部分 纳税申报收入增加了 54亿美元.
With a rapid expansion in data, it is becoming increasingly difficult to analyze all of it through traditional methodologies. Moreover, a country of the huge population like India cannot only rely on traditional methods of survey. Many of the analytical problems could not be solved before due to high expenses. Also, the analytical platforms could not handle such large volumes of data. Moreover, most of the data collected were not present in the electronic form. All these problems made it difficult to manage and analyze large amounts of data. Therefore, we need Big Data to analyze an astronomical amount of data that is generating every day.
随着数据的快速扩展,通过传统方法分析所有数据变得越来越困难. 此外,像印度这样人口众多的国家不能只依靠传统的调查方法. 由于费用高昂,许多分析问题以前无法解决. 此外,分析平台无法处理如此大量的数据. 此外,收集到的大部分数据都没有以电子形式出现. 这些问题的存在使得对海量数据的管理和分析变得非常困难. 因此,我们需要大数据来分析每天产生的天文数字.
In the light of the above issues and the limitations in the current technology, the Income Tax Department has introduced Big Data Analytics to monitor and track the social networking profiles of people and find patterns of inconsistencies between their income and spending. Therefore, the Income Tax Department has launched Project Insight for this purpose.
鉴于上述问题和当前技术的局限性, 所得税部门引入了大数据分析,以监控和跟踪人们的社交网络状况,并发现他们的收入和支出之间不一致的模式. 因此,所得税部门为此目的推出了项目洞察.
What Exactly is Project Insight?
Project Insight 到底是什么?
Project Insight is a combination of Big Data Mangement System and Machine Learning. It is a composite analytical platform that will be able to store and manage the income and expenditure of citizens as well as possess analytical capabilities to analyze the patterns of inconsistencies within these two features.
项目洞察是大数据管理系统和 机器学习.它是一个综合分析平台,能够存储和管理公民的收入和支出,并具备分析这两个特征中不一致的模式的分析能力.
Project Insight’s big data analytics system will actively track the social media of the citizens. It will search for exorbitant purchases and track people indulging in lavish lifestyle which does not comply with their income and tax returns. This means that based on the activities on social media and the purchasing history, IT departments can analyze and find out the discrepancies in the documented earning and actual spendings. Moreover, the data gathered will include the addresses of taxpayers and their IT return profiles. All these attributes will ensure real-time monitoring of the citizens. There is also a business intelligence platform that includes large scale data warehousing and analysis of data. It will help the income tax officials to track high-value transactions and inhibit the flow of black money.
项目洞察的大数据分析 系统会主动 追踪市民的社交媒体 .它将寻找过高的购买,并跟踪那些不符合收入和纳税申报的奢侈生活方式的人.这意味着,IT 部门可以根据社交媒体上的活动和采购历史,分析并找出记录在案的收入和实际支出的差异.此外,收集的数据将包括纳税人的地址及其 IT 回报档案.所有这些属性将确保对公民的实时监控.还有一个 商业智能包括大规模数据仓库和数据分析的平台.这将有助于所得税官员跟踪高价值交易,并抑制黑钱的流动.
The government allied itself with L&T Infotech to implement this project. It will help to combine the databases of IT returns, TDS & TCS statements, IT forms and forms from financial institutions of the country. Also, as part of Project Insight, a Compliance Management Centralised Processing Centre will handle verification, management of campaign and the generation and follow-up of letters and notices.
政府与之结盟 L & T 信息技术 实施这个项目.它将有助于整合来自该国金融机构的 It 回报、 TDS & TCS 报表、 IT 表单和表单数据库.此外,作为项目洞察的一部分,合规管理集中处理中心将处理活动的验证、管理以及信函和通知的生成和跟进.
The Income Tax Department has introduced a 360 Degree vision profiling that allows them to identify non-filers of tax through social media monitoring.
所得税部门出台了 360 度视野 通过社交媒体监控,他们可以识别非申报人.
Now is the Time to be Aware of the Term – Big Data
现在是了解大数据这个术语的时候了
As concluded by the cases mentioned above, big data has revolutionized all walks of life. It is being extensively used fields like finance, health, manufacturing, and transportation. With the massive explosion in data, there is a huge requirement for people who are specialized in handling data. We generate about 2.5 quintillion bytes of data every day. It has become the new oil. This form of data is mostly unstructured and cannot be managed with the traditional RDBMS. Big Data provides all the tools and means necessary to extract store and analyze to gain insights for help companies make careful data-driven decisions.
通过上述案例,大数据已经彻底改变了各行各业.在金融、卫生、制造、交通等领域得到了广泛的应用.随着数据的大规模爆炸,对专门处理数据的人提出了巨大的要求.我们产生关于 每天 2.5 字节的数据. 它已经成为新的石油.这种形式的数据大多是非结构化的,不能用传统的 关系数据库.大数据提供了提取存储和分析的所有必要工具和手段,以获得帮助公司做出谨慎的数据驱动决策的见解.
Start Learning Big Data With Industry Experts
As mentioned above, most of the data generated by us is present in the unstructured format and we need specialized Big Data Engineers and Data Scientists to handle such a huge volume of unstructured information. From the above-discussed case scenario of analyzing tax returns, Big Data is also used to improve customer experiences, predicting sales, analyzing genomic sequences, customer churn analysis, etc. With Big Data, companies are able to explore new areas that were previously undiscovered, given the lack of technology and tools. With the effective collection of data and its analysis, organizations gain deep insights about their products, clients, their performance and their competitors.
正如上面提到的,我们生成的大部分数据都是以非结构化的格式存在的,我们需要专门的大数据工程师处理如此庞大的非结构化信息的数据科学家.从上述分析纳税申报表的案例场景来看,大数据还被用于改善客户体验、预测销售、分析基因组序列、客户流失分析等.有了大数据,由于缺乏技术和工具,公司能够探索以前未被发现的新领域.通过有效的数据收集和分析,组织可以深入了解他们的产品、客户、绩效和竞争对手.
The opportunities associated with Big Data are as vast as data itself. With the huge expanse in data, there is also a huge increase in the requirement for people who are specialized in handling data. Many people are looking to transform their careers into that of Big Data. It is seen as the key to landing a big job in the industry.
的大数据带来的机遇和数据本身一样巨大.随着数据的巨大膨胀,对专门处理数据的人的需求也有了巨大的增长.许多人都希望将自己的职业生涯转变为大数据的职业生涯.这被认为是在该行业找到一份大工作的关键.
The current Big Data pool is shallow. As a result, more and more companies are looking forward to hiring Big Data Specialists and are offering them lucrative salaries. Also, if you have the knowledge of big data and its tools, you can also work as a freelancer.
目前的大数据池比较浅.因此,越来越多的公司期待着雇佣大数据专家,并为他们提供丰厚的薪水.此外,如果你对大数据及其工具有所了解,你也可以作为自由职业者工作.
What Skills Will You Require to Master Big Data?
掌握大数据需要哪些技能?
In order to** learn Big Data**, there is no essential background skill that is required. However, in order to better grasp Big Data, you must have knowledge of a programming language, preferably Java. You must be familiar with the basics of programming. For Big Data, the three programming languages you should also know are – Java, Scala and Python. While knowledge of Python and Scala is not mandatory, it will be beneficial for getting in-depth insight into Big Data. People with the knowledge of Java will find the learning curve of Scala to be much easier as Scala is a Java proprietary language.
为了学习大数据,没有必要的基本背景技能.但是要更好的掌握大数据,必须要有编程语言的知识,最好是 Java 的知识.你必须熟悉编程的基础知识.对于大数据,你应该知道的三种编程语言是 Java 、 Scala 和 Python.虽然 Python 和 Scala 的知识不是必须的,但深入了解大数据将是有益的.掌握 Java 知识的人会发现,由于 Scala 是一种 Java 专有语言,Scala 的学习曲线要容易得多.
While Scala is primarily a functional programming language, it is useful while working with** Apache Spark which is a popular Big Data tool**. Apart from this, you must have a working knowledge of Python, which is increasingly becoming the most popular choice of programming for Data Analytics. Not to mention, most of the Big Data tools support Python as their programming language.
虽然 Scala 主要是一种函数式编程语言,但它在使用流行的大数据工具 Apache Spark.除此之外,你必须对 Python 有一定的工作知识,Python 正日益成为数据分析编程的最受欢迎的选择.更不用说,大部分大数据工具都支持 Python 作为编程语言.
Other than this, all you need to learn Big Data is a curiosity to learn and explore the concepts of Big Data technology.
除此之外,学习大数据所需要的只是对学习和探索大数据技术概念的好奇心.
Popular Big Data Tools You Must Know
你必须知道的流行大数据工具
In this section, we will look at two of the popular big data tools. These technologies are – Apache Hadoop and Apache Spark. These technologies have made Big Data a possibility and a skill that can be acquired by anyone, owing to their open-source nature.
在本节中,我们将介绍两种流行的大数据工具.这些技术是 Apache Hadoop 和 Apache Spark.这些技术使得大数据成为一种可能,并且由于其开源特性,任何人都可以获得这种技术.
Apache Hadoop – Apache Hadoop is an open-source big data platform. It is written in Java Programming language but offers support in languages like C++, Ruby, Python and Java. This technology is the most sought after in industries. Hadoop is renowned for its distributed database system called Hadoop Distributed File System. It consists of MapReduce which is a powerful framework responsible for processing large volumes of data. Hadoop is also popular for its data locality concept which increases the efficiency of Hadoop applications. Due to this reason, it is widely preferred by industries and is a must know tool for Big Data enthusiasts.
Apache Hadoop-Apache Hadoop 是一个开源的大数据平台.它是用 Java 编程语言编写的,但在 C + + 、 Ruby 、 Python 和 Java 等语言中提供支持.这项技术在工业中是最受欢迎的.Hadoop 以其分布式数据库系统 Hadoop 分布式文件系统而闻名.MapReduce 是一个强大的框架,负责处理大量的数据.Hadoop 的数据局部性概念也很受欢迎,它提高了 Hadoop 应用程序的效率.由于这个原因,它被行业广泛青睐,是大数据爱好者必须了解的工具.
Learn Hadoop With Industry Experts
Apache Spark – Apache Spark is another big data tool which is popular for its lightning fast computational capabilities. It is an evolution of Apache Hadoop that allows high-speed streaming of data and quick access to SQL workloads with machine learning capabilities. As a matter of fact, Spark can run 100 times faster than MapReduce. Due to this reason, it is increasingly becoming popular among industries who are looking for lightning fast big data platforms.
Apache Spark-Apache Spark 是另一个以其快速计算能力而广受欢迎的大数据工具.这是 Apache Hadoop 的发展,它允许数据高速流动,并通过机器学习功能快速访问 SQL 工作负载.事实上,Spark 的运行速度比 MapReduce 快 100 倍.正因为如此,它在寻找闪电般的快速大数据平台的行业中越来越受欢迎.
We saw how Big Data has revolutionized other industries and how Big Data professionals are in huge demand. You could be the next Big Data Expert who is going to catch tax evaders, start your Big Data learning now.
我们看到大数据给其他行业带来了革命性的变化,也看到了大数据专业人士的巨大需求.你可能会成为下一个抓住逃税者的大数据专家,立即开始大数据学习.
网友评论