代做ECON 128作业、代写Machine Learning作

作者: wkra464 | 来源:发表于2019-03-21 14:44 被阅读0次

ECON 128 Machine Learning Final ProjectBackgroundIn order to eliminate poverty, it is imperative to be able to identify households suffering from poverty and target them with assistance. However, the identification of households in poverty relies on data from consumption surveys that is difficult, expensive, and time-consuming to collect.Therefore, recent efforts have been focused on the use of “rapid surveys” that rely a limited number of poverty identifiers that serve as effective proxies for the calculation of a household’s poverty status.ObjectiveThe World Bank has asked you to identify the most important variables that determine a household’s poverty status to help them reduce the cost associated with compiling data to predict poverty.DataThe data provided for analysis is household responses to a World bank consumption survey. Each observation has a unique household id to reflect the survey responses of that distinct household. Further, each household is labeled in or out of poverty through the Poor indicator variable. A sample of the data is as followsNotice that all of the variables are encoded as random character strings but reflect actual survey questions. For categorical variables, these variables may reflect questions such as does your household have items such as Bar soap, Cooking oil, Matches, and Salt. Numeric questions often ask things like How many working cell phones in total does your household own or How many separate rooms do the members of your household occupy? The project is not meant for you to determine the real meaning of the variables you select, rather just identify the best variables in their encoded state to best predict poverty.Two datasets in the format pictured above are supplied, one for model training and one for model testing. No external data beyond what is provided should be used for modeling.Error Metric When evaluating your model’s performance in its ability to predict a household’s poverty status, you should use the logloss error metric. We define the logloss metric through the following formula:The logloss metric any value from 0 to positive infinity in which a model scoring a 0 is a perfect classifier. Also, notice how the logloss error function operates. The metric rewards a model that confidently classifies a household correctly and punishes a model that is overconfident for wrong classifications. For example, a model that predicts a high probability of a household being poor and the household is actually poor will receive a lower logloss score than a model that predicts a high probability of poverty for a household that is not poor.DeliverablesAssuming that World Bank has contracted you for this project, compile a report that communicates your problem-solving approach and works through the aspects of a data science project. Essentially, your report should contain the following elements:1)Problem DescriptionDemonstrate your understanding of what World bank wants you to accomplish and an overview of your solution plan.2)Description of the data used for analysis.Review some summary statistics of your data. For example, what is the distribution of poor households in your data set?Upon reviewing the data, are there any problems that may present itself for modeling. For example, if the distribution of poor households is heavily skewed, how may this affect the model? Similarly, are there certain variables with missing data? Do we need to impute data?Describe any data cleaning performed. For example, were any new features or transformations of the data created. If missing data was found, how was it imputed?3)MethodsDescribe any models you are using with a brief explanation of how they work. If chosen models involve hyperparameters, explain what the hyperparameters are and how you plan to select the hyperparameters.Focus on why you selected your models. Are there any advantages that you feel your model/approach has over other models/approaches? Any disadvantages?4)CodeThe execution of the proposed methods in code. Ensure that there are sufficient comments such that someone unfamiliar with your code can understand what you are doing.5)Results and ConclusionHow well did your model perform on the test data?What variables should the World Bank focus on to effectively predict poverty?Justify any performance versus error metric tradeoff for a model with a sub-selection of variables. Is the sacrifice in model performance according to your chosen error metric for the reduced variable model worth it?This project has been adapted from a competition on the data science website: DrivenData.本团队核心人员组成主要包括硅谷工程师、BAT一线工程师，精通德英语！我们主要业务范围是代做编程大作业、课程设计等等。我们的方向领域：window编程数值算法 AI人工智能金融统计计量分析大数据网络编程 WEB编程通讯编程游戏编程多媒体linux 外挂编程程序API图像处理嵌入式/单片机数据库编程控制台进程与线程网络安全汇编语言硬件编程软件设计工程标准规等。其中代写编程、代写程序、代写留学生程序作业语言或工具包括但不限于以下范围:C/C++/C#代写Java代写IT代写Python代写辅导编程作业Matlab代写Haskell代写Processing代写Linux环境搭建Rust代写Data Structure Assginment 数据结构代写MIPS代写Machine Learning 作业代写Oracle/SQL/PostgreSQL/Pig 数据库代写/代做/辅导Web开发、网站开发、网站作业ASP.NET网站开发Finance Insurace Statistics统计、回归、迭代Prolog代写Computer Computational method代做因为专业，所以值得信赖。如有需要，请加QQ：99515681 或邮箱：99515681@qq.com 微信：codehelp

网友评论

本文标题：代做ECON 128作业、代写Machine Learning作

本文链接：https://www.haomeiwen.com/subject/zjnqvqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

代做ECON 128作业、代写Machine Learning作

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读