无辜的Musanzikwa
验证专家 in Engineering
数据工程师和开发人员
Inno is a seasoned data engineer and developer who's worked at IRI—a top retail data analytics company—in Africa and North America for the past decade and as a freelance consultant for the past couple of years. 作为SQL和ETL开发人员, he has created quality data warehouses using industry-standard techniques like Kimball and DataVaults. 作为数据工程师, Inno has built highly robust and scalable data pipelines both on-premise and on the cloud using several latest cutting-edge technologies.
Portfolio
Experience
Availability
首选的环境
SQL, PySpark, Python, Hadoop, Apache Hive, Azure突触, Oracle, SQL Server集成服务(SSIS), Azure数据工厂, 数据仓库
最神奇的...
...big data warehousing and data integration solution I've designed—using Python, SQL, ADF, Hadoop, Hive, spark从六家竞争对手中赢得了加拿大的RFP.
工作经验
Data Engineer
Darwill, Inc.
- Built Tableau dashboards and visualizations using AWS Redshift and Aurora databases.
- Created AWS Lambda functions running Python for custom ETL tasks and ad-hoc requests.
- Managed AWS Redshift and Aurora databases and designed data warehouses and data migrations.
- Redesigned the client's data warehouse using the AWS tech stack and improved their migration process by introducing federated queries and Lambda functions running Python pipelines, 以及彻底改造他们的Tableau仪表板.
Data Engineer
SFL科学有限公司
- Consulted on an existing SSIS poorly designed data integration project and helped identify bottlenecks and inefficiencies.
- Redesigned the existing data pipeline using SSIS to be efficient and scalable.
- 执行SQL调优和SQL代码审查以提高流程效率.
BI和数据仓库专家
航空控股有限责任公司
- Designed and developed data pipelines to integrate data from Quickbooks API, Sage完整API, 和电子表格转换成Azure SQL.
- 在Azure SQL中设计并开发了一个数据仓库.
- Designed and created business reports and KPI dashboards using Power BI.
- Developed complex SQL scripts to manage data transformations and speed up integration.
迁移项目的数据分析师
JLL - JLLT数据
- Developed the data pipeline to integrate data from Salesforce to Microsoft SQL.
- 设计高级SQL代码.g., CTE, stored procedures, and functions to manage data transformations.
- Performed SQL tuning to improve ETL efficiencies and process scalability.
- 咨询标准操作程序和最佳情况.
总监|数据工程
IRI
- Developed Azure数据工厂 pipelines to integrate data from Apache Hive, HDFS, OAuth 2 APIs, 和各种平面文件类型转换为Azure SQL.
- 管理陆上和海上大数据开发团队, 在Jira上分配任务并跟踪进度.
- Oversaw data strategy and recommendations for new data sources and ongoing projects.
- 指导大数据工程师,帮助他们提高技能.
- Architected new data models and upgraded old data warehouses as per client request or technology change.
ETL架构师
IRI
- 在本地和云端开发基于sql的数据仓库.
- Integrated various data sources from flat files to cloud-based data sources like Snowflake, 将AWS和数据湖整合到Azure数据仓库, 以及Hadoop上的Apache Hive.
- Created scalable data pipelines and improved efficiencies on the existing ones.
- Trained and upskilled new data developers and participated in code reviews.
- Maintained system documentation of all business data components and strategies.
SQL首席开发人员
IRI
- 开发了基于sql的数据仓库和数据集市.
- 编写SQL查询,为SSRS报告提供数据.
- Used SSIS, Talend, and DataStage for ETL processes depending on the client's requirements.
- Created custom business reports using SQL Server报表服务(SSRS).
- 管理初级开发人员并主持独立开发会议.
SQL/ETL开发和顾问
Mi9零售(原JustEnough软件公司)
- 管理移动设备和SQL Server之间的SQL复制.
- Created SQL data warehouses using the Kimball methodology for reporting purposes.
- Designed and developed ETL packages using SQL Server集成服务(SSIS).
- Designed and developed reports in SQL Server报表服务(SSRS).
- Performed database tuning and code reviews for any code being deployed to production.
Experience
从Azure SQL到Snowflake的数据迁移
http://github.com/innowarue/ADFI replaced the authentic data sources with my Azure and Snowflake accounts to make the project publicly available without compromising confidentiality.
来自OAuth2 API的数据集成
SQL Server复制到移动设备
就地数据集成的收购
Kafka流和数据集成
Skills
Languages
SQL, Python, Bash Script, t - sql (transact - sql), Snowflake, 存储过程, SQL DML, Scala, JavaScript, Bash
Frameworks
Hadoop, Spark, Windows PowerShell, ADF
库/ api
PySpark, REST api, Spark Streaming
Tools
Microsoft Power BI, Tableau, BigQuery, Synapse, SSAS, Apache气流, Amazon Elastic MapReduce (EMR), Git, 谷歌表
Paradigms
ETL, 商业智能(BI), 维度建模, 数据库开发, 数据库设计, Data Science
Platforms
亚马逊网络服务(AWS), AWS Lambda, Azure SQL数据仓库, 专用SQL池(以前称为SQL DW), Azure, Microsoft Power automation, Azure突触, Oracle, Databricks, Apache Kafka, Salesforce, Zeppelin
Storage
Apache Hive, MySQL, SQL Server集成服务(SSIS), SQL Server报表服务(SSRS), PSQL, Microsoft SQL Server, SQL存储过程, PostgreSQL, Databases, 数据管道, 数据集成, 关系数据库, 数据库体系结构, RDBMS, 数据库建模, Dynamic SQL, NoSQL, SQL Server DBA, 数据库复制, Azure SQL, MariaDB
Other
Azure数据工厂, 数据仓库, Data Analysis, 工程数据, Data, 数据架构, Big Data, 数据迁移, ELT, 数据仓库设计, 数据转换, 数据库模式设计, ETL Tools, 脚本语言, 数据分析, 数据可视化, SSRS Reports, SQL Server 2015, 实体关系, 业务分析, 性能调优, 数据建模, Cloud, APIs, 仪表盘的设计, Dashboards, Web Scraping, 数据构建工具(dbt), iPaaS, CI / CD管道, DAX, 数据清理, Azure砖
Education
信息技术学士学位
南非大学-比勒陀利亚,南非
认证
Databricks注册数据工程师助理
Databricks
SnowPro Core
Snowflake
认证Apache Spark和Hadoop开发人员
Cloudera
用Hive分析大数据
LinkedIn学习
数据科学高级NoSQL
LinkedIn学习
如何使用Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
分享你的需求
选择你的才能
开始你的无风险人才试验
对顶尖人才的需求很大.
Start hiring