2021-06-20 18:37:10 +02:00
|
|
|
|
# Amazon data science job comparison
|
|
|
|
|
Made as a part of Machine Learning Engineering classes at AMU Poznań.
|
|
|
|
|
|
|
|
|
|
Goal of this task is to compare 6 (at least) interesting job offers, compare requirements stated by the employers
|
|
|
|
|
and prepare a short speech about the requirement that we have found to be particularly important.
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 1 [Sr Manager, Data Science](https://www.amazon.jobs/en/jobs/1544978/sr-manager-data-science)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** AWS Data Science
|
|
|
|
|
|
|
|
|
|
**Location:** Seattle, Washington
|
|
|
|
|
|
|
|
|
|
### Required:
|
|
|
|
|
* PhD or equivalent Master's Degree plus 10+ years of experience in a quantitative field.
|
|
|
|
|
* **5+ years of people management experience**
|
|
|
|
|
* Strong analytical skills.
|
|
|
|
|
* 10+ years of experience of building predictive models for business and proficiency in model development and model
|
|
|
|
|
validation.
|
|
|
|
|
* Experience managing data pipelines for data ingestion
|
|
|
|
|
* Experience working with software development teams and taking models to production
|
|
|
|
|
* **Experience managing business stakeholders across organizations**
|
|
|
|
|
* **Strong communication skills**
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Experience with time series modeling and machine learning forecasting.
|
|
|
|
|
* Experience with supply chain methodologies
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 2 [Data Scientist, Network - Core](https://www.amazon.jobs/en/jobs/1590641/data-scientist-network-core)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** AWS Data Science
|
|
|
|
|
|
|
|
|
|
**Location:** Seattle, Washington
|
|
|
|
|
|
|
|
|
|
### Required:
|
|
|
|
|
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
|
|
|
|
|
problems
|
|
|
|
|
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
|
|
|
|
|
* Experience working in very large scale problems
|
|
|
|
|
* Experience investigating the feasibility of applying scientific concepts to business problems and products
|
|
|
|
|
* Must have at least two years of experience in the following skill(s): programming with a mathematical programming
|
|
|
|
|
language such as R, MATLAB, or SAS or major programming language such as Python, Java, C++, C#, or C
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
|
|
|
|
|
problems
|
|
|
|
|
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
|
|
|
|
|
* Experience working in very large scale problems and applying simple solutions that demonstrate deep understanding of
|
|
|
|
|
the problems
|
|
|
|
|
* Experience investigating the feasibility of applying scientific concepts to business problems and products
|
|
|
|
|
* Three years of experience in the following skill(s): R, MATLAB, or SAS or major programming language such as Python,
|
|
|
|
|
Java, C++, C#, or C
|
|
|
|
|
* Prior work experience and/or academic research in area of Time Series, Network Modelling or equivalent
|
|
|
|
|
* **Superior verbal and written communication and presentation skills, ability to convey rigorous mathematical
|
|
|
|
|
concepts and considerations to non-experts**
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 3 [Data Scientist](https://www.amazon.jobs/en/jobs/777583/data-scientist)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** AWS Cleared Jobs
|
|
|
|
|
|
|
|
|
|
**Location:** Herndon Area, VAWashington, DC | Greater Metro Area
|
|
|
|
|
|
|
|
|
|
### Required:
|
|
|
|
|
* A Bachelor or Masters Degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research,
|
|
|
|
|
Statistics, Mathematics, etc.) or equivalent experience
|
|
|
|
|
* 5+ years of industry experience in predictive modeling, data science and analysis
|
|
|
|
|
* Previous experience in a ML or data scientist type of role and a track record of building ML or DL models
|
|
|
|
|
* Active TS/SCI clearance with polygraph
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Graduate degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research, Statistics,
|
|
|
|
|
Mathematics, etc.)
|
|
|
|
|
* 10+ years of industry experience in predictive modeling
|
|
|
|
|
* Good skills with programming languages, such as Java or C/C++
|
|
|
|
|
* Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines,
|
|
|
|
|
ability to accurately identify cause and effect relationships
|
|
|
|
|
* **Consulting experience and track record of helping customers with their AI needs**
|
|
|
|
|
* **Publications or presentation in recognized Machine Learning, Deep Learning and Data Mining journals/conferences**
|
|
|
|
|
* Experience using Python and/or R
|
|
|
|
|
* Knowledge of SparkML
|
|
|
|
|
* Able to write production level code, which is well-written and explainable
|
|
|
|
|
* Experience using ML libraries, such as scikit-learn, caret, mlr, mllib
|
|
|
|
|
* Experience working with GPUs to develop models
|
|
|
|
|
* Experience handling terabyte size datasets
|
|
|
|
|
* Track record of diving into data to discover hidden patterns
|
|
|
|
|
* **Familiarity with using data visualization tools**
|
|
|
|
|
* Knowledge and experience of writing and tuning SQL
|
|
|
|
|
* **Past and current experience writing and speaking about complex technical concepts to broad audiences in
|
|
|
|
|
a simplified format**
|
|
|
|
|
* **Experience giving data presentations**
|
|
|
|
|
* **Strong written and verbal communication skills**
|
|
|
|
|
* Experience with AWS technologies like Redshift, S3, EC2, Data Pipeline, & EMR
|
|
|
|
|
* **Combination of deep technical skills and business savvy enough to interface with all levels and disciplines
|
|
|
|
|
within our customer’s organization**
|
|
|
|
|
* Demonstrable track record of dealing well with ambiguity, prioritizing needs, and delivering results in a dynamic
|
|
|
|
|
environment
|
|
|
|
|
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 4 [Computer Vision Data Scientist](https://www.amazon.jobs/en/jobs/1520242/computer-vision-data-scientist)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** AWS Cleared Jobs
|
|
|
|
|
|
|
|
|
|
**Location:** US, VA
|
|
|
|
|
|
|
|
|
|
### Required:
|
|
|
|
|
* Master or PhD in computer vision/machine learning or related experience.
|
|
|
|
|
* 3+ years of relevant experience in building production-scale system/algorithm in one of the following domains:
|
|
|
|
|
computer vision, deep learning, or machine learning.
|
|
|
|
|
* Coding skills in one or more programming languages such as Python, Scala, Java, C, C+
|
|
|
|
|
* 2-3 years of modeling experience working with deep learning frameworks like Pytorch or MxNet.
|
|
|
|
|
* Current hands-on experience with state-of-the-art object detection approaches (e.g. Faster RCNN, YOLO, CenterNet etc.)
|
|
|
|
|
* Understanding of deep learning CV evaluation metrics including mAP, F_beta, PR curves, etc.
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Broad knowledge of fundamentals and state-of-the-art in computer vision/machine learning.
|
|
|
|
|
* Experience leveraging and augmenting large code base and computer vision/machine libraries/toolkits to deliver
|
|
|
|
|
new solutions.
|
|
|
|
|
* Experience extending object detection models to multi-object, multi-label tracking
|
|
|
|
|
* Experience working with geospatial datasets (e.g. satellite imagery)
|
|
|
|
|
* Experience working with motion imagery datasets (e.g. Full Motion Video/ FMV, Wide Area Motion Imagery/ WAMI)
|
|
|
|
|
* Proven track record of innovation in creating novel algorithms and advancing the state of the art
|
|
|
|
|
* Distributed training experience (DDP, Horovod)
|
|
|
|
|
* Model compilation experience (TensorRT, TVM)
|
|
|
|
|
* Familiarity deploying solutions to AWS or cloud services and experience with AWS services such as SageMaker is considered a plus
|
|
|
|
|
* Familiarity deploying solutions to IoT/edge platforms (e.g. NVIDIA Jetson Xavier)
|
|
|
|
|
* **Experience in publishing at major computer science conferences or journals**
|
|
|
|
|
* **Proven track record in technically leading and mentoring scientists**
|
|
|
|
|
* **Strong written and verbal communication skills and ability to work effectively with a large, distributed team.**
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 5 [Data Scientist - AWS Infrastructure](https://www.amazon.jobs/en/jobs/1587231/data-scientist-aws-infrastructure)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** AWS Data Science
|
|
|
|
|
|
|
|
|
|
**Location:** Arlington Area, VA
|
|
|
|
|
|
|
|
|
|
### Required:
|
|
|
|
|
* Advanced degree (M.S. or Ph.D.) in Engineering, Math, Statistics, Finance, Computer Science, or related
|
|
|
|
|
industry experience.
|
|
|
|
|
* 3+ Years of experience in data science/analysis/engineering
|
|
|
|
|
* 2+ Years of experience applying Statistics/Data Science/Machine Learning
|
|
|
|
|
* 2+ Years of Scripting experience in Python/R or other scripting languages
|
|
|
|
|
* 2+ Years of SQL experience
|
|
|
|
|
* **2+ Years of experience in Data Visualization, using Tableau, R Shiny, other off the shelf products,
|
|
|
|
|
or scripting directly**
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Experience in modeling and optimization
|
|
|
|
|
* Working knowledge of AWS tech stack.
|
|
|
|
|
* Experience with clustered data processing (e.g. Hadoop, Spark, Map-reduce, Hive)
|
|
|
|
|
* **Experience in communicating technically, at a level appropriate for the audience.**
|
|
|
|
|
|
|
|
|
|
|
2021-06-20 18:39:46 +02:00
|
|
|
|
## Job 6 [Language Engineer](https://www.amazon.jobs/en/jobs/1603588/language-engineer)
|
2021-06-20 18:37:10 +02:00
|
|
|
|
**Employer:** Amazon
|
|
|
|
|
|
|
|
|
|
**Team:** Alexa Speech
|
|
|
|
|
|
|
|
|
|
**Location:** US, CA
|
|
|
|
|
### Required:
|
|
|
|
|
* Knowledge of scripting languages (e.g. Python, bash)
|
|
|
|
|
* Knowledge of phonetics/phonology and ability to analyze/validate phonetic transcriptions
|
|
|
|
|
* Native or near-native fluency in a non-English language
|
|
|
|
|
* **Excellent written and spoken communication skills**
|
|
|
|
|
|
|
|
|
|
### Nice to have:
|
|
|
|
|
* Master’s in Computational Linguistics (or equivalent field with computational emphasis); alternatively,
|
|
|
|
|
2 years of experience in the field.
|
|
|
|
|
* Hands-on experience working with Natural Language Processing or Speech Processing
|
|
|
|
|
* Experience in writing grammars and building FSTs
|
|
|
|
|
* Strong personal interest in learning, researching, and creating new technologies related to foreign languages,
|
|
|
|
|
linguistics, phonetics, phonology and language technology
|
|
|
|
|
* Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment
|
|
|
|
|
|
|
|
|
|
## Required skills summary:
|
2021-06-20 18:58:52 +02:00
|
|
|
|
| Offer id | MSc | PhD | Communication | Visualization | AWS | Big Data | SQL | ML tools |
|
|
|
|
|
|----------|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|
|
|
|
|
|
| 1 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |
|
|
|
|
|
| 2 | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
|
|
|
|
|
| 3 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :heavy_check_mark: |
|
|
|
|
|
| 4 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
|
|
|
|
|
| 5 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
|
|
|
|
|
| 6 | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |
|