Find Jobs
Hire Freelancers

Data Transformation Pipeline in Jupyter Notebook

$30-250 USD

已完成
已发布超过 6 年前

$30-250 USD

货到付款
Develop a python script (can't be cmd driven) that enables me to quickly cleanse and transform datasets of varying sizes for use in other analytics systems. Using a Jupyter Notebook I want to import complex datasets and wrangle them for use in virtually any target system. Key capabilities include: - Import from flat file - Locate and remove or modify missing or mismatched data - Unnest complex data structures - Identify statistical outliers in your data for review and management - Perform lookups from one dataset into another reference dataset - Aggregate columnar data using a variety of aggregation functions - Merge datasets with joins - Append one dataset to another through union operations This is not intended to be a web app of any kind. There is really no front-end to speak of... I simply want to be able to interact with the Jupyter Notebook to pull all this off. In general, the flow is as follows: 1. Import data: Integrate data from a variety of sources of data. 2. Profile our data: Before, during, and after we transform our data, we can use the visual profiling tools to quickly analyze and make decisions about your data. 3. Build transform recipes: Use the various views in the Transformers to build our transform recipes and preview the results on sampled data. 4. Generate Results: Launch a task to run our recipe on the full dataset. Review results and iterate as needed. 5. Export results: Export the generated results data for use outside of the script running in Jupyter Notebook. Walking through the above, you will have noticed that we imported, cleansed, transformed, and possibly enhanced our data for use in the next step of our analytics pipeline. Here are the greater details of what we are expecting as part of this solution: We expect that most of the functions contained within Pandas will suffice for what we need. However, each column within in an imported Pandas dataframe needs to have all the below available to be applied to it should a user decide to select it: ^^^Please See Uploaded Document for More Details^^^
项目 ID: 15348575

关于此项目

5提案
远程项目
活跃7 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
A proposal has not yet been provided
$166 USD 在7天之内
0.0 (0条评论)
0.0
0.0
5威客以平均价$185 USD来参与此工作竞价
用户头像
Hello! Any manipulations done in Jupyter Notebooks are part of my day job as a bioinformatics analyst. Relevant Skills and Experience Python, Jupyter, Data processing Proposed Milestones $294 USD - All
$294 USD 在5天之内
5.0 (11条评论)
4.5
4.5
用户头像
I have a good experience on working with Advanced R and Python. I have quite a good knowledge of Deep learning and ML Algorithm , have also developed dashboards and Shiny Web Application. Relevant Skills and Experience I understand the project requirement and will deliver the desired product within the time specified. Proposed Milestones $155 USD - milestone
$155 USD 在3天之内
4.5 (10条评论)
4.1
4.1
用户头像
A proposal has not yet been provided
$177 USD 在7天之内
5.0 (3条评论)
2.7
2.7
用户头像
I am python expert with data analytics. Relevant Skills and Experience Python, Jupyter notebbok, Excel, Data Processing Proposed Milestones $133 USD - full task
$133 USD 在2天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
Franklin, United States
5.0
9
付款方式已验证
会员自4月 17, 2010起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。