Avatar for Scribd
Scribd
Actively Hiring
Thousands of the best books, audiobooks, and more. all in one app
  • B2C
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • +3

Data Architect/Principal Data Engineer

Posted: 4 months ago
Visa Sponsorship

Not Available

Hires remotely
Everywhere
RelocationAllowed

About the job

At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare.

We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

Our flexible work benefit - Scribd Flex - enables employees, in partnership with their manager, to choose the daily work-style that best suits their individual needs. As an organization, we prioritize collaboration and intentional in-person moments to build culture and connection. For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location.

What You'll Do:As a pivotal member of the team, you will lead the design and development of a robust data architecture that guides data modeling, integration, processing, and delivery standards enabling modern data product development at Scribd.

You will also serve as a data and analytics solution architect, leading architecture initiatives encompassing data warehousing, data pipeline development, data integrations, and data modeling. You will shape Scribd’s data strategy, guiding stakeholders in how they consume and act on data.

We’re looking for someone with proven proficiency in architecting, designing and development experience with batch and real time streaming infrastructure and workloads. Your expertise will help establish standards for data modeling, integration, processing, and delivery and also help translate business requirements into technical specifications.

At Scribd, we leverage deep data insights to inform every aspect of our business, from product development, experimentation, to understanding our subscriber engagement and tracking key performance indicators. You'll join a data engineering team tackling complex challenges within a rich domain encompassing three distinct brands – Scribd, Everand, and Slideshare – all serving a massive user base with over 200 million monthly visitors and 2 million paying subscribers. You'll have the opportunity to make a real impact as we are heavily investing in improving our core data layer and this exciting new role puts you right at the forefront of this initiative.

Based on the project, this might involve cross-functional work with the Data Science, Analytics, and other Engineering and Business teams to design cohesive data models, database schemas and data storage solutions, consumption strategies and patterns. Almost everything you will be working on will be to increase the "customer satisfaction" for internal customers of Scribd data.

Required Skills:• 7+ years of experience in data engineering, with a strong background in data architecture, data modeling, and data management, building and scaling robust data systems for complex business domains.• Expertise in Scala or Python, with a deep understanding and hands-on experience in Spark for designing, optimizing, and scaling large-scale data processing pipelines, and proficiency in at least one SQL dialect.• Experience with data lake technologies (e.g., Databricks, Delta Lake), data storage formats (Parquet, Avro), query engines (such as Photon, Spark SQL), and both real-time streaming and batch processing, or equivalent technologies and frameworks.

Desired Skills:• Experience and working knowledge of streaming platforms, typically based around Kafka.• Strong grasp of AWS data platform services and their strengths/weaknesses.• Hands on experience in implementing data pipelines for data ingestion and transformation to support analytics and ML pipelines• Strong experience communicating asynchronously using collaboration tools like Jira, Slack, etc.• Experience using automation and CI/CD tooling like Git, GitHub,Docker,Jenkins, Terraform, etc.• Experience developing standards for database design and implementation of various strategic data architecture initiatives around data quality, data management policies/standards, data governance, privacy and metadata management• Working experience integrating with BI frameworks like Qlik, ThoughtSpot, Looker, Tableau, etc.

About the company

Scribd company logo

Scribd

Actively Hiring
Thousands of the best books, audiobooks, and more. all in one app201-500 Employees
Company Size
201-500
Company Type
Software
Company Industries
Publishing
Company Industries
File Sharing
Company Industries
EBooks
Company Industries
Audiobooks
Company Industries
Magazine
  • B2C
  • Scale Stage
    Rapidly increasing operations
  • Top Investors
    This company has received a significant amount of investment from top investors
  • YC Funded
    Startup funded by Y Combinator
  • 4.5
    Highly rated
    Scribd is highly rated on Glassdoor, with 4.5 out of 5 stars
  • 4.3
    Work / Life Balance
    Employees rate Scribd 4.3/5 on Glassdoor for work / life balance
Learn more about Scribd image

Funding

AMOUNT RAISED
$105.8M
FUNDED OVER
7 rounds
Rounds
U
$58,000,000
Unknown - Nov 2019+6

Perks

Full health + dental
Scribd provides paid parental leave for the birth, adoption, or foster placement of a child.
Matching 401(k)
You’ll have the option to contribute to a matching 401(k).
Paid parental leave
Scribd provides paid parental leave for the birth, adoption, or foster placement of a child.
Gym membership
Fully paid gym membership to the fantastic new facility in our San Francisco headquarters.
Wellness benefit
$50 per month in wellness allowance for activities including yoga, fitness classes, exercise, or pilates.
Professional development
We love to send employees to conferences like WWDC, Droidcon, RubyConf, and RailsConf.

Founders

Jared Friedman
Founder • 3 years
San Francisco
image
View the team image

Similar Jobs

MightyByte company logo
MightyByte
Building awesome, scalable apps to power the future of tech
Loxo company logo
Loxo
#1 Talent Intelligence Platform & global leader in AI Recruitment Automation Software
C3.ai company logo
C3.ai
C3 AI is a leading enterprise AI software provider for accelerating digital transformation
Flow Labs company logo
Flow Labs
We’re making cleaner, clearer, safer roads for everyone — right now
Fathom company logo
Fathom
deep learning to automate medical coding
Enigma Technologies company logo
Enigma Technologies
Building world-class infrastructure and tools that transform how businesses interact
ghSMART company logo
ghSMART
ghSMART is a team of extraordinary people who become trusted advisors to the most influential leader