180 lines
9.2 KiB
Markdown
180 lines
9.2 KiB
Markdown
# Amazon data science job comparison
|
||
Made as a part of Machine Learning Engineering classes at AMU Poznań.
|
||
|
||
Goal of this task is to compare 6 (at least) interesting job offers, compare requirements stated by the employers
|
||
and prepare a short speech about the requirement that we have found to be particularly important.
|
||
|
||
## Job 1 [Sr Manager, Data Science]{https://www.amazon.jobs/en/jobs/1544978/sr-manager-data-science}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** AWS Data Science
|
||
|
||
**Location:** Seattle, Washington
|
||
|
||
### Required:
|
||
* PhD or equivalent Master's Degree plus 10+ years of experience in a quantitative field.
|
||
* **5+ years of people management experience**
|
||
* Strong analytical skills.
|
||
* 10+ years of experience of building predictive models for business and proficiency in model development and model
|
||
validation.
|
||
* Experience managing data pipelines for data ingestion
|
||
* Experience working with software development teams and taking models to production
|
||
* **Experience managing business stakeholders across organizations**
|
||
* **Strong communication skills**
|
||
|
||
### Nice to have:
|
||
* Experience with time series modeling and machine learning forecasting.
|
||
* Experience with supply chain methodologies
|
||
|
||
## Job 2 [Data Scientist, Network - Core]{https://www.amazon.jobs/en/jobs/1590641/data-scientist-network-core}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** AWS Data Science
|
||
|
||
**Location:** Seattle, Washington
|
||
|
||
### Required:
|
||
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
|
||
problems
|
||
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
|
||
* Experience working in very large scale problems
|
||
* Experience investigating the feasibility of applying scientific concepts to business problems and products
|
||
* Must have at least two years of experience in the following skill(s): programming with a mathematical programming
|
||
language such as R, MATLAB, or SAS or major programming language such as Python, Java, C++, C#, or C
|
||
|
||
### Nice to have:
|
||
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
|
||
problems
|
||
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
|
||
* Experience working in very large scale problems and applying simple solutions that demonstrate deep understanding of
|
||
the problems
|
||
* Experience investigating the feasibility of applying scientific concepts to business problems and products
|
||
* Three years of experience in the following skill(s): R, MATLAB, or SAS or major programming language such as Python,
|
||
Java, C++, C#, or C
|
||
* Prior work experience and/or academic research in area of Time Series, Network Modelling or equivalent
|
||
* **Superior verbal and written communication and presentation skills, ability to convey rigorous mathematical
|
||
concepts and considerations to non-experts**
|
||
|
||
## Job 3 [Data Scientist]{https://www.amazon.jobs/en/jobs/777583/data-scientist}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** AWS Cleared Jobs
|
||
|
||
**Location:** Herndon Area, VAWashington, DC | Greater Metro Area
|
||
|
||
### Required:
|
||
* A Bachelor or Masters Degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research,
|
||
Statistics, Mathematics, etc.) or equivalent experience
|
||
* 5+ years of industry experience in predictive modeling, data science and analysis
|
||
* Previous experience in a ML or data scientist type of role and a track record of building ML or DL models
|
||
* Active TS/SCI clearance with polygraph
|
||
|
||
### Nice to have:
|
||
* Graduate degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research, Statistics,
|
||
Mathematics, etc.)
|
||
* 10+ years of industry experience in predictive modeling
|
||
* Good skills with programming languages, such as Java or C/C++
|
||
* Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines,
|
||
ability to accurately identify cause and effect relationships
|
||
* **Consulting experience and track record of helping customers with their AI needs**
|
||
* **Publications or presentation in recognized Machine Learning, Deep Learning and Data Mining journals/conferences**
|
||
* Experience using Python and/or R
|
||
* Knowledge of SparkML
|
||
* Able to write production level code, which is well-written and explainable
|
||
* Experience using ML libraries, such as scikit-learn, caret, mlr, mllib
|
||
* Experience working with GPUs to develop models
|
||
* Experience handling terabyte size datasets
|
||
* Track record of diving into data to discover hidden patterns
|
||
* **Familiarity with using data visualization tools**
|
||
* Knowledge and experience of writing and tuning SQL
|
||
* **Past and current experience writing and speaking about complex technical concepts to broad audiences in
|
||
a simplified format**
|
||
* **Experience giving data presentations**
|
||
* **Strong written and verbal communication skills**
|
||
* Experience with AWS technologies like Redshift, S3, EC2, Data Pipeline, & EMR
|
||
* **Combination of deep technical skills and business savvy enough to interface with all levels and disciplines
|
||
within our customer’s organization**
|
||
* Demonstrable track record of dealing well with ambiguity, prioritizing needs, and delivering results in a dynamic
|
||
environment
|
||
|
||
|
||
## Job 4 [Computer Vision Data Scientist]{https://www.amazon.jobs/en/jobs/1520242/computer-vision-data-scientist}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** AWS Cleared Jobs
|
||
|
||
**Location:** US, VA
|
||
|
||
### Required:
|
||
* Master or PhD in computer vision/machine learning or related experience.
|
||
* 3+ years of relevant experience in building production-scale system/algorithm in one of the following domains:
|
||
computer vision, deep learning, or machine learning.
|
||
* Coding skills in one or more programming languages such as Python, Scala, Java, C, C+
|
||
* 2-3 years of modeling experience working with deep learning frameworks like Pytorch or MxNet.
|
||
* Current hands-on experience with state-of-the-art object detection approaches (e.g. Faster RCNN, YOLO, CenterNet etc.)
|
||
* Understanding of deep learning CV evaluation metrics including mAP, F_beta, PR curves, etc.
|
||
|
||
### Nice to have:
|
||
* Broad knowledge of fundamentals and state-of-the-art in computer vision/machine learning.
|
||
* Experience leveraging and augmenting large code base and computer vision/machine libraries/toolkits to deliver
|
||
new solutions.
|
||
* Experience extending object detection models to multi-object, multi-label tracking
|
||
* Experience working with geospatial datasets (e.g. satellite imagery)
|
||
* Experience working with motion imagery datasets (e.g. Full Motion Video/ FMV, Wide Area Motion Imagery/ WAMI)
|
||
* Proven track record of innovation in creating novel algorithms and advancing the state of the art
|
||
* Distributed training experience (DDP, Horovod)
|
||
* Model compilation experience (TensorRT, TVM)
|
||
* Familiarity deploying solutions to AWS or cloud services and experience with AWS services such as SageMaker is considered a plus
|
||
* Familiarity deploying solutions to IoT/edge platforms (e.g. NVIDIA Jetson Xavier)
|
||
* **Experience in publishing at major computer science conferences or journals**
|
||
* **Proven track record in technically leading and mentoring scientists**
|
||
* **Strong written and verbal communication skills and ability to work effectively with a large, distributed team.**
|
||
|
||
## Job 5 [Data Scientist - AWS Infrastructure]{https://www.amazon.jobs/en/jobs/1587231/data-scientist-aws-infrastructure}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** AWS Data Science
|
||
|
||
**Location:** Arlington Area, VA
|
||
|
||
### Required:
|
||
* Advanced degree (M.S. or Ph.D.) in Engineering, Math, Statistics, Finance, Computer Science, or related
|
||
industry experience.
|
||
* 3+ Years of experience in data science/analysis/engineering
|
||
* 2+ Years of experience applying Statistics/Data Science/Machine Learning
|
||
* 2+ Years of Scripting experience in Python/R or other scripting languages
|
||
* 2+ Years of SQL experience
|
||
* **2+ Years of experience in Data Visualization, using Tableau, R Shiny, other off the shelf products,
|
||
or scripting directly**
|
||
|
||
### Nice to have:
|
||
* Experience in modeling and optimization
|
||
* Working knowledge of AWS tech stack.
|
||
* Experience with clustered data processing (e.g. Hadoop, Spark, Map-reduce, Hive)
|
||
* **Experience in communicating technically, at a level appropriate for the audience.**
|
||
|
||
|
||
## Job 6 [Language Engineer]{https://www.amazon.jobs/en/jobs/1603588/language-engineer}
|
||
**Employer:** Amazon
|
||
|
||
**Team:** Alexa Speech
|
||
|
||
**Location:** US, CA
|
||
### Required:
|
||
* Knowledge of scripting languages (e.g. Python, bash)
|
||
* Knowledge of phonetics/phonology and ability to analyze/validate phonetic transcriptions
|
||
* Native or near-native fluency in a non-English language
|
||
* **Excellent written and spoken communication skills**
|
||
|
||
### Nice to have:
|
||
* Master’s in Computational Linguistics (or equivalent field with computational emphasis); alternatively,
|
||
2 years of experience in the field.
|
||
* Hands-on experience working with Natural Language Processing or Speech Processing
|
||
* Experience in writing grammars and building FSTs
|
||
* Strong personal interest in learning, researching, and creating new technologies related to foreign languages,
|
||
linguistics, phonetics, phonology and language technology
|
||
* Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment
|
||
|
||
## Required skills summary:
|
||
| Offer id | skill 1 | skill 2 |
|
||
| 1 | :x: |:heavy_check_mark: | |