Costco
Site Reliability Engineer - DAEO
Plano, TX
Dec 19, 2024
Full-time
Full Job Description

Costco IT is responsible for the technical future of Costco Wholesale, the third largest retailer in the world with wholesale operations in fourteen countries. Despite our size and explosive international expansion, we continue to provide a family, employee centric atmosphere in which our employees thrive and succeed.

This is an environment unlike anything in the high-tech world and the secret of Costco’s success is its culture. The value Costco puts on its employees is well documented in articles from a variety of publishers including Bloomberg and Forbes. Our employees and our members come FIRST. Costco is well known for its generosity and community service and has won many awards for its philanthropy. The company joins with its employees to take an active role in volunteering by sponsoring many opportunities to help others.

Come join the Costco Wholesale IT family. Costco IT is a dynamic, fast-paced environment, working through exciting transformation efforts. We are building the next generation retail environment where you will be surrounded by dedicated and highly professional employees.

Data Engineers are responsible for developing and operationalizing data pipelines/integrations to make data available for consumption (i.e. Reporting, Data Science/Machine Learning, Data APIs, etc.). This includes data ingestion, data transformation, data validation/quality, data pipeline optimization, orchestration; and deploying code to production via CI/CD. The Data Engineer role requires knowledge of software development/programming methodologies, various data sources (Relational Databases, flat files (csv, delimited), APIs, XML, JSON, etc.), data access (SQL, Python, etc.), followed by expertise in data modeling, cloud architectures/platforms, data warehousing, and data lakes. This role also will partner closely with Product Owners, Data Architects, Platform/DevOps Engineers, etc. to design, build, test, implement, and maintain data pipelines.

Engineers have deep knowledge and hands-on experience in enterprise-wide platforms, and solve technical problems while working on technology initiatives. Engineers will be required to be available for 24x7 support as needed and be on call on a rotational basis. Engineers have strong architectural, leadership, and technical skills. Engineers should have high-level skills in PowerBI, Databricks, Azure, Alteryx, UC4, and Webi. System administration, monitoring experience and observability is a plus. Engineers interact in a highly effective manner with other team members and management, drive innovation, and influence delivery and performance.

The Site Reliability Engineer (SRE) will be responsible for maintaining and improving the availability, performance, and capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support. The SRE will translate Costco’s goals and strategies for system availability, performance, and capacity into designs and plans for technical solutions. The SRE will work with other Costco teams to resolve issues affecting the availability, performance, or capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support.

The SRE will also work with other Costco teams to identify upcoming events that could affect demand on system performance and prepare mitigation plans. The SRE will work with teams and System Architects to implement, maintain, and validate disaster recovery plans and other solutions to avoid or mitigate service interruptions.

The SRE will monitor the availability, performance, and capacity of Data Analytics Engineering Operations Problem and Incident Resolution Support to identify trends and concerns. Create and disseminate system reliability reports to Costco management in support of planning and decision making. The SRE will also assist in the development of policies, standards, and guidelines for the maintenance and operation of Costco’s overall Data Analytics Engineering Operations Problem and Incident Resolution Support Solutions.

Additionally, this role will work closely with other members of DAEO, Operations teams, the Quality Assurance team, Software Development teams, Support teams, and management to achieve team goals.

If you want to be a part of one of the worldwide BEST companies “to work for”, simply apply and let your career be reimagined.

ROLE

● Develops complex SQL & Python against a variety of data sources.

● Implements streaming data pipelines using event/message-based architectures.

● Demonstrates ability to communicate technical concepts to non-technical audiences both in written and verbal form.

● Defines and maintains optimal data pipeline architecture.

● Analyzes data to spot anomalies, trends and correlate data to ensure Data Quality.

● Develops data pipelines to store data in defined data models/structures.

● Demonstrates strong understanding of data integration techniques and tools (e.g. Extract, Transform, Load (ETL) / Extract, Load, Transform (ELT)) tools.

● Demonstrates strong understanding of database storage concepts (Data Lake, Relational Databases, NoSQL, Graph, data warehousing).

● Performs peer review for another Data Engineer’s work.

● Partners with Project Managers, Solution Leads, and other stakeholders to help maintain a robust framework to support applications and quality solutions.

● Contributes, interprets, and communicates enterprise, technical, project, and operational strategies to the team.

● Works with teams, management, and stakeholders to conceptualize, design, build, test, and release products.

● Shares relevant information among teams.

● Influences and drives adoption of best practices and high-quality standards throughout the division.

● Integrates diverse solution components across multiple platforms using industry standard interfaces.

● Tests and resolves problems, performs root cause analysis, identifies gaps, recommends solutions and preventative measures, and leads team members to solution delivery plans.

● Runs proof of concepts and uses diagnostic/debugging skills to solve current challenges in multi-platform systems.

● Orchestrates reviews for system additions and/or enhancements.

● Promotes and supports a culture of compliance, risk avoidance/mitigation, and corporate accountability throughout the organization through technical leadership, knowledge of business need, development and communication of policies, procedures, plans, and assurance of solution designs that are in compliance with architecture standards, technology guardrails, security, and operational guidelines.

● Provides leadership/mentoring to team members; implements development efficiencies; creates appropriate documentation; drives operational efficiencies and technical growth within the team, and supports the release model.

● Optimizes team efficiency and performance through high-level technical direction.

● Provides technical leadership in implementation of applications, strategic planning sessions, documentation of requirements, tool implementation, database query languages, and programming languages.

● Assists in management and operation of site reliability functions related to Data Analytics Engineering Operations Problem & Incident Resolution Support.

● Develops, establishes, and enforces policies, standards, and guidelines for site reliability.

● Identifies, designs, develops, and deploys tools and processes to monitor, maintain, and report site performance and availability.

● Tracks system performance, capacity, and uses experience to create effective strategies for maintaining and improving system performance and availability. Advises the International Ecommerce team on said strategies.

● Uses communication and documentation to communicate and coordinate with other team members.

● Applies technical expertise to lead the resolution of system issues related to system performance, availability, and capacity.

● Contributes to and maintains an in-depth understanding of the Non-Production and Production architectures, including the hosted server environments, deployed applications, integrated packages, and third-party operational tools, particularly as they relate to system performance, availability, and capacity.

● Works with the design, development, QA, Technical Operations teams, as well as other stakeholders, to maintain the performance, availability, and integrity of Data Analytics Engineering Operations Problem & Incident Resolution Support.

● Communicates effectively with project teams and other participants in ongoing system development to identify and resolve issues, explain solutions, and provide technical expertise.

REQUIRED

● Experience in the development, maintenance, and operation of highly-available systems.

● Experience in the ability to communicate clearly with a range of stakeholders, including clients, employees, and contractors. They need to be able to explain technical information and negotiate well.

● Knowledge and deep understanding of Costco’s systems and operating environments to detect and diagnose issues before they cause outages or performance degradation.

● Expertise in the use of monitoring and reporting tools for performance and analysis.

● Ability to work effectively with other teams, including vendors, customers, users, managers, and peers.

● Essential skills needed to be a data analyst including Sql queries, Excel/Google Sheets, Critical Thinking, Data Visualization and Presentation experience.

● Understand how to use the power of data to drive decision-making and solve complex problems.

● Hands-on experience operating and maintaining high-availability systems. Past experience tuning and maintaining the performance of systems is desirable.

● Ability to prototype and demonstrate mechanisms for performance improvement, high availability, and system scaling.

● Proficient in the use of the Google products: email, spreadsheet, document, presentation, analytics.

● Excellent interpersonal and diplomatic skills, as well as a positive attitude.

● Strong communication skills, both oral and written, including presentation skills.

● Demonstrated team-building skills with ability to influence and motivate team members that report to other groups in the organization.

● Adept at assessing issues with ability to devise workable solutions quickly.

● Able to work independently.

● Excellent organizational and planning skills, with experience building tactical plans.

● Flexible; must be able to change priorities quickly, focus on new ones without distraction; able to deal with conflict and work under pressure to meet deliverable dates/timelines.

● Experience in negotiating timelines and deliverables with a strong sense of urgency.

● Scheduling flexibility to meet the needs of the business in a 24x7x365 operations environment - evening, weekend, and holiday work.

Recommended

● 2-5 years of operational support and strategies knowledge preferred.

● Familiarity with Azure technology tools (ADO Boards, Repos, Pipelines, DevOps, Databases, etc)

● Familiarity with Service-Now Application, including familiarity with Incident Handling and Problem Resolution.

● Familiarity with principles of Continuous Integration and deployment practices (CI/CD).

● Ability to research and introduce new technologies, practices, and techniques; open to continued learning.

● Familiarity with Program and Project Management practices.

● Experience working on medium and large-scale projects, under both traditional and agile development methodologies.

● Strong knowledge of Systems Development Life Cycle processes, as well as demonstrated experience with Agile and DevOps methodologies.

● Proficient in Google Workspace applications, including Sheets, Docs, Slides, and Gmail.

Required Documents

● Cover Letter

● Resume

California applicants, please click here to review the Costco Applicant Privacy Notice.

Pay Ranges:

Level 1 - $85,000 - $110,000

Level 2 - $105,000 - $135,000

Level 3 - $130,000 - $160,000

Level SR - $150,000 - $190,000, Bonus and Restricted Stock Unit (RSU) eligible

We offer a comprehensive package of benefits including paid time off, health benefits - medical/dental/vision/hearing aid/pharmacy/behavioral health/employee assistance, health care reimbursement account, dependent care assistance plan, short-term disability and long-term disability insurance, AD&D insurance, life insurance, 401(k), stock purchase plan to eligible employees.

Costco is committed to a diverse and inclusive workplace. Costco is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or any other legally protected status. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to IT-Recruiting@costco.com

If hired, you will be required to provide proof of authorization to work in the United States. In some cases, applicants and employees for selected positions will not be sponsored for work authorization, including, but not limited to H1-B visas.

PDN-9dc24d37-c417-4b63-bf6c-87f8aa2040f9
Job Information
Job Category:
Engineering
Spotlight Employer
Related jobs
Office Assistant
Education Minnesota
POSITION TITLE:                          Office Assistant (Job #2024-21) DEPARTM...
Dec 19, 2024
Saint Paul, MN
Director of Facilities and Grounds
CENTRAL VALLEY SCHOOL DISTRICT
Central Valley School District Director of Facilities and Grounds Position responsible for day to day operations of all district facilities to include: 4 buildings, athletic facilities, maintenance ga...
Dec 19, 2024
Monaca, PA
Director of Facilities and Grounds
Central Valley School District
DIRECTOR OF FACILITIES AND GROUNDSPosition responsible for day-to-day operations of all district facilities to include: 4 buildings, athletic facilities, maintenance garage, and district-owned vehicle...
Dec 19, 2024
Monaca, PA
©2024 TalentAlly.
Powered by TalentAlly.
Apply for this job
Site Reliability Engineer - DAEO
Costco
Plano, TX
Dec 19, 2024
Full-time
Your Information
First Name *
Last Name *
Email Address *
Zip Code *
Password *
Confirm Password *
Create your Profile from your Resume
By clicking the Apply button, you agree to the terms of use and privacy policy.
Continue to Apply

Costco would like you to finish the application on their website.

Ace your interview with AI-powered interview practice

Get comfortable talking to hiring managers, receive personalized feedback on areas for improvement, sharpen your ability to answer the most common questions, and build confidence in formulating strong responses on the spot. Click the button below to begin your three free virtual interviews!