Cargill Finance Global Data Lake Specialist in Bangalore, India
Finance Global Data Lake Specialist
Cargill provides food, agriculture, financial and industrial products and services to the world. Together with farmers, customers, governments and communities, we help people thrive by applying our insights and over 150 years of experience. We have 150,000 employees in 70 countries who are committed to feeding the world in a responsible way, reducing environmental impact and improving the communities where we live and work.
The Finance organization within Cargill includes FP&A, A&F, Corporate Financial Reporting, Treasury, Tax and the related Finance activities performed in the Cargill business Enterprises. In addition it includes the various Finance activities executed by the 6 Cargill Business Service centers, globally. The group collectively is working to establish a concerted focus on critical data management areas to help achieve a vision of more widely accessible, usable and secured financial related data. This position will be part of the ground breaking activities towards achieving these goals.
The Finance Data Lake Specialist is expected to work closely with cross Finance stakeholders to ensure the ecosystem built inside the Cargill Data Platform to house financial data lakes is in place to support the ability to acquire, transform and analyze Data. This model should ensure Finance specific data lakes and the corporate data warehouse work in unison, each playing its role. As the data lake grows, both in terms of usage and volume of data housed, the data lake expert is expected to setup the optimum governance, compliance, security and auditing. Regular interaction with an agreed upon member of the Finance Data Council will also be required to help champion the cause of expanding data lake usage and having an agreed governance model.
1.Establish and seamlessly govern the Finance data lake setup and capability (70%)
a.Work with a select group (comprising of Finance and IT stakeholders) to review existing data lake usage, understand the emerging future data lake needs and define a process to formalize and implement the operating model for finance data lake development and usage.
b.Include IT and finance leaders on the team to give a balanced view of data use, governance and data lake needs.
c.Implement repeatable operating procedures for ongoing data ingestion along with key metadata and metrics to be captured during the process.
d.Ensure appropriate metadata is created, maintained and accessible to expedite data analysis. This includes coordinating creation of data dictionaries, data profiling and data quality report generation.
e.Coordinate data format/data dictionary changes with business data stewards and external data vendors when applicable.
f.Keep the section of the lake used for data exploration organized and perform housekeeping on behalf of the data scientists.
g.Enforce Finance defined naming standards, access controls and data cleanup policies in various zones of the data lake. Reduce accumulation of redundant, obsolete and duplicative data in data lake.
h.Provide metrics on data utilization and data quality in the data lake.
i.Involve at least one Data Council member to champion the cause of data lake usage and supporting an agreed governance model.
j.Partner with development teams for the creation of new data marts/datasets in the Data Lake for use by downstream applications.
k.Participate in design discussions and contribute to the architecture process.
2.Data Security and Compliance in the data lake environment (20%)
a.Make sure effective data classifications are in place in the data lake environment so data requiring protection for legal or compliance reasons meets those requirements in an auditable way.
b.Set levels of data access in the data lake by leveraging the existing Cargill network for system access, single sign-on through Active Directory or an agreed upon other solution. The preference is to not have separate data access protocols.
c.Implement tools that allow for different levels of access based on user privileges; this protects sensitive data while allowing access to higher-level analytics.
d.Include mobility and BYOD (bring your own device) in the plans as these are often large sources of potential data leakage.
e.Partner with the Cargill Data Platform team, Finance Data Security Officer and Finance business users/data owners.
3.Promote the value of leveraging the use of a controlled and appropriately governed data lake environment for the use of data by multiple teams whereby data comes from a common source and large volumes of data can be utilized for analysis, reporting, etc., in a cost effective manner. This will further promote the data management focused culture/mindset and monitor & communicate conformance with Finance data policies (5%)
4.Miscellaneous duties and responsibilities as assigned (5%)
Minimum Required Qualifications
B.E. and/or MBA from institute of repute with atleast 10 years of work experience
Minimum of 5-7 years hands-on experience with Information Management and Big Data technologies e.g. Hadoop, Spark, Hive. Robust experience with Cloudera is a plus.
Excellent verbal and written English communication skills and demonstrated experiences communicating across diverse cultures, teams and regions
Ability, as exhibited from past experiences, to build and explain a robust data lake management operating model and focused culture
Experience in Data Management and Analytics in multinational corporations
Experience in SAP
Formal degree or qualification in Data Management or Data Lake (or Data Platform) Modeling
Formal certification in Self-service
Primary Location India-KA-Bangalore
Job Type Standard
Shift Day Job
Req ID: BAN01996