ium-praca/readme.md

187 lines
10 KiB
Markdown
Raw Normal View History

2021-06-20 18:37:10 +02:00
# Amazon data science job comparison
Made as a part of Machine Learning Engineering classes at AMU Poznań.
Goal of this task is to compare 6 (at least) interesting job offers, compare requirements stated by the employers
and prepare a short speech about the requirement that we have found to be particularly important.
2021-06-20 18:39:46 +02:00
## Job 1 [Sr Manager, Data Science](https://www.amazon.jobs/en/jobs/1544978/sr-manager-data-science)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Seattle, Washington
### Required:
* PhD or equivalent Master's Degree plus 10+ years of experience in a quantitative field.
* **5+ years of people management experience**
* Strong analytical skills.
* 10+ years of experience of building predictive models for business and proficiency in model development and model
validation.
* Experience managing data pipelines for data ingestion
* Experience working with software development teams and taking models to production
* **Experience managing business stakeholders across organizations**
* **Strong communication skills**
### Nice to have:
* Experience with time series modeling and machine learning forecasting.
* Experience with supply chain methodologies
2021-06-20 18:39:46 +02:00
## Job 2 [Data Scientist, Network - Core](https://www.amazon.jobs/en/jobs/1590641/data-scientist-network-core)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Seattle, Washington
### Required:
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
problems
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
* Experience working in very large scale problems
* Experience investigating the feasibility of applying scientific concepts to business problems and products
* Must have at least two years of experience in the following skill(s): programming with a mathematical programming
language such as R, MATLAB, or SAS or major programming language such as Python, Java, C++, C#, or C
### Nice to have:
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
problems
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
* Experience working in very large scale problems and applying simple solutions that demonstrate deep understanding of
the problems
* Experience investigating the feasibility of applying scientific concepts to business problems and products
* Three years of experience in the following skill(s): R, MATLAB, or SAS or major programming language such as Python,
Java, C++, C#, or C
* Prior work experience and/or academic research in area of Time Series, Network Modelling or equivalent
* **Superior verbal and written communication and presentation skills, ability to convey rigorous mathematical
concepts and considerations to non-experts**
2021-06-20 18:39:46 +02:00
## Job 3 [Data Scientist](https://www.amazon.jobs/en/jobs/777583/data-scientist)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** AWS Cleared Jobs
**Location:** Herndon Area, VAWashington, DC | Greater Metro Area
### Required:
* A Bachelor or Masters Degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research,
Statistics, Mathematics, etc.) or equivalent experience
* 5+ years of industry experience in predictive modeling, data science and analysis
* Previous experience in a ML or data scientist type of role and a track record of building ML or DL models
* Active TS/SCI clearance with polygraph
### Nice to have:
* Graduate degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research, Statistics,
Mathematics, etc.)
* 10+ years of industry experience in predictive modeling
* Good skills with programming languages, such as Java or C/C++
* Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines,
ability to accurately identify cause and effect relationships
* **Consulting experience and track record of helping customers with their AI needs**
* **Publications or presentation in recognized Machine Learning, Deep Learning and Data Mining journals/conferences**
* Experience using Python and/or R
* Knowledge of SparkML
* Able to write production level code, which is well-written and explainable
* Experience using ML libraries, such as scikit-learn, caret, mlr, mllib
* Experience working with GPUs to develop models
* Experience handling terabyte size datasets
* Track record of diving into data to discover hidden patterns
* **Familiarity with using data visualization tools**
* Knowledge and experience of writing and tuning SQL
* **Past and current experience writing and speaking about complex technical concepts to broad audiences in
a simplified format**
* **Experience giving data presentations**
* **Strong written and verbal communication skills**
* Experience with AWS technologies like Redshift, S3, EC2, Data Pipeline, & EMR
* **Combination of deep technical skills and business savvy enough to interface with all levels and disciplines
within our customers organization**
* Demonstrable track record of dealing well with ambiguity, prioritizing needs, and delivering results in a dynamic
environment
2021-06-20 18:39:46 +02:00
## Job 4 [Computer Vision Data Scientist](https://www.amazon.jobs/en/jobs/1520242/computer-vision-data-scientist)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** AWS Cleared Jobs
**Location:** US, VA
### Required:
* Master or PhD in computer vision/machine learning or related experience.
* 3+ years of relevant experience in building production-scale system/algorithm in one of the following domains:
computer vision, deep learning, or machine learning.
* Coding skills in one or more programming languages such as Python, Scala, Java, C, C+
* 2-3 years of modeling experience working with deep learning frameworks like Pytorch or MxNet.
* Current hands-on experience with state-of-the-art object detection approaches (e.g. Faster RCNN, YOLO, CenterNet etc.)
* Understanding of deep learning CV evaluation metrics including mAP, F_beta, PR curves, etc.
### Nice to have:
* Broad knowledge of fundamentals and state-of-the-art in computer vision/machine learning.
* Experience leveraging and augmenting large code base and computer vision/machine libraries/toolkits to deliver
new solutions.
* Experience extending object detection models to multi-object, multi-label tracking
* Experience working with geospatial datasets (e.g. satellite imagery)
* Experience working with motion imagery datasets (e.g. Full Motion Video/ FMV, Wide Area Motion Imagery/ WAMI)
* Proven track record of innovation in creating novel algorithms and advancing the state of the art
* Distributed training experience (DDP, Horovod)
* Model compilation experience (TensorRT, TVM)
* Familiarity deploying solutions to AWS or cloud services and experience with AWS services such as SageMaker is considered a plus
* Familiarity deploying solutions to IoT/edge platforms (e.g. NVIDIA Jetson Xavier)
* **Experience in publishing at major computer science conferences or journals**
* **Proven track record in technically leading and mentoring scientists**
* **Strong written and verbal communication skills and ability to work effectively with a large, distributed team.**
2021-06-20 18:39:46 +02:00
## Job 5 [Data Scientist - AWS Infrastructure](https://www.amazon.jobs/en/jobs/1587231/data-scientist-aws-infrastructure)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Arlington Area, VA
### Required:
* Advanced degree (M.S. or Ph.D.) in Engineering, Math, Statistics, Finance, Computer Science, or related
industry experience.
* 3+ Years of experience in data science/analysis/engineering
* 2+ Years of experience applying Statistics/Data Science/Machine Learning
* 2+ Years of Scripting experience in Python/R or other scripting languages
* 2+ Years of SQL experience
* **2+ Years of experience in Data Visualization, using Tableau, R Shiny, other off the shelf products,
or scripting directly**
### Nice to have:
* Experience in modeling and optimization
* Working knowledge of AWS tech stack.
* Experience with clustered data processing (e.g. Hadoop, Spark, Map-reduce, Hive)
* **Experience in communicating technically, at a level appropriate for the audience.**
2021-06-20 18:39:46 +02:00
## Job 6 [Language Engineer](https://www.amazon.jobs/en/jobs/1603588/language-engineer)
2021-06-20 18:37:10 +02:00
**Employer:** Amazon
**Team:** Alexa Speech
**Location:** US, CA
### Required:
* Knowledge of scripting languages (e.g. Python, bash)
* Knowledge of phonetics/phonology and ability to analyze/validate phonetic transcriptions
* Native or near-native fluency in a non-English language
* **Excellent written and spoken communication skills**
### Nice to have:
* Masters in Computational Linguistics (or equivalent field with computational emphasis); alternatively,
2 years of experience in the field.
* Hands-on experience working with Natural Language Processing or Speech Processing
* Experience in writing grammars and building FSTs
* Strong personal interest in learning, researching, and creating new technologies related to foreign languages,
linguistics, phonetics, phonology and language technology
* Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment
## Required skills summary:
2021-06-20 18:58:52 +02:00
| Offer id | MSc | PhD | Communication | Visualization | AWS | Big Data | SQL | ML tools |
|----------|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|
| 1 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |
| 2 | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| 3 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :heavy_check_mark: |
| 4 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
| 5 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
| 6 | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |