ium-praca/readme.md
2021-06-20 18:58:52 +02:00

187 lines
10 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# Amazon data science job comparison
Made as a part of Machine Learning Engineering classes at AMU Poznań.
Goal of this task is to compare 6 (at least) interesting job offers, compare requirements stated by the employers
and prepare a short speech about the requirement that we have found to be particularly important.
## Job 1 [Sr Manager, Data Science](https://www.amazon.jobs/en/jobs/1544978/sr-manager-data-science)
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Seattle, Washington
### Required:
* PhD or equivalent Master's Degree plus 10+ years of experience in a quantitative field.
* **5+ years of people management experience**
* Strong analytical skills.
* 10+ years of experience of building predictive models for business and proficiency in model development and model
validation.
* Experience managing data pipelines for data ingestion
* Experience working with software development teams and taking models to production
* **Experience managing business stakeholders across organizations**
* **Strong communication skills**
### Nice to have:
* Experience with time series modeling and machine learning forecasting.
* Experience with supply chain methodologies
## Job 2 [Data Scientist, Network - Core](https://www.amazon.jobs/en/jobs/1590641/data-scientist-network-core)
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Seattle, Washington
### Required:
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
problems
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
* Experience working in very large scale problems
* Experience investigating the feasibility of applying scientific concepts to business problems and products
* Must have at least two years of experience in the following skill(s): programming with a mathematical programming
language such as R, MATLAB, or SAS or major programming language such as Python, Java, C++, C#, or C
### Nice to have:
* Experience formulating and solving predictive modeling, machine learning, forecasting or statistical modeling
problems
* PhD or equivalent Master's degree plus 3+ years of research experience in a quantitative filed
* Experience working in very large scale problems and applying simple solutions that demonstrate deep understanding of
the problems
* Experience investigating the feasibility of applying scientific concepts to business problems and products
* Three years of experience in the following skill(s): R, MATLAB, or SAS or major programming language such as Python,
Java, C++, C#, or C
* Prior work experience and/or academic research in area of Time Series, Network Modelling or equivalent
* **Superior verbal and written communication and presentation skills, ability to convey rigorous mathematical
concepts and considerations to non-experts**
## Job 3 [Data Scientist](https://www.amazon.jobs/en/jobs/777583/data-scientist)
**Employer:** Amazon
**Team:** AWS Cleared Jobs
**Location:** Herndon Area, VAWashington, DC | Greater Metro Area
### Required:
* A Bachelor or Masters Degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research,
Statistics, Mathematics, etc.) or equivalent experience
* 5+ years of industry experience in predictive modeling, data science and analysis
* Previous experience in a ML or data scientist type of role and a track record of building ML or DL models
* Active TS/SCI clearance with polygraph
### Nice to have:
* Graduate degree in a highly quantitative field (Computer Science, Machine Learning, Operations Research, Statistics,
Mathematics, etc.)
* 10+ years of industry experience in predictive modeling
* Good skills with programming languages, such as Java or C/C++
* Ability to develop experimental and analytic plans for data modeling processes, use of strong baselines,
ability to accurately identify cause and effect relationships
* **Consulting experience and track record of helping customers with their AI needs**
* **Publications or presentation in recognized Machine Learning, Deep Learning and Data Mining journals/conferences**
* Experience using Python and/or R
* Knowledge of SparkML
* Able to write production level code, which is well-written and explainable
* Experience using ML libraries, such as scikit-learn, caret, mlr, mllib
* Experience working with GPUs to develop models
* Experience handling terabyte size datasets
* Track record of diving into data to discover hidden patterns
* **Familiarity with using data visualization tools**
* Knowledge and experience of writing and tuning SQL
* **Past and current experience writing and speaking about complex technical concepts to broad audiences in
a simplified format**
* **Experience giving data presentations**
* **Strong written and verbal communication skills**
* Experience with AWS technologies like Redshift, S3, EC2, Data Pipeline, & EMR
* **Combination of deep technical skills and business savvy enough to interface with all levels and disciplines
within our customers organization**
* Demonstrable track record of dealing well with ambiguity, prioritizing needs, and delivering results in a dynamic
environment
## Job 4 [Computer Vision Data Scientist](https://www.amazon.jobs/en/jobs/1520242/computer-vision-data-scientist)
**Employer:** Amazon
**Team:** AWS Cleared Jobs
**Location:** US, VA
### Required:
* Master or PhD in computer vision/machine learning or related experience.
* 3+ years of relevant experience in building production-scale system/algorithm in one of the following domains:
computer vision, deep learning, or machine learning.
* Coding skills in one or more programming languages such as Python, Scala, Java, C, C+
* 2-3 years of modeling experience working with deep learning frameworks like Pytorch or MxNet.
* Current hands-on experience with state-of-the-art object detection approaches (e.g. Faster RCNN, YOLO, CenterNet etc.)
* Understanding of deep learning CV evaluation metrics including mAP, F_beta, PR curves, etc.
### Nice to have:
* Broad knowledge of fundamentals and state-of-the-art in computer vision/machine learning.
* Experience leveraging and augmenting large code base and computer vision/machine libraries/toolkits to deliver
new solutions.
* Experience extending object detection models to multi-object, multi-label tracking
* Experience working with geospatial datasets (e.g. satellite imagery)
* Experience working with motion imagery datasets (e.g. Full Motion Video/ FMV, Wide Area Motion Imagery/ WAMI)
* Proven track record of innovation in creating novel algorithms and advancing the state of the art
* Distributed training experience (DDP, Horovod)
* Model compilation experience (TensorRT, TVM)
* Familiarity deploying solutions to AWS or cloud services and experience with AWS services such as SageMaker is considered a plus
* Familiarity deploying solutions to IoT/edge platforms (e.g. NVIDIA Jetson Xavier)
* **Experience in publishing at major computer science conferences or journals**
* **Proven track record in technically leading and mentoring scientists**
* **Strong written and verbal communication skills and ability to work effectively with a large, distributed team.**
## Job 5 [Data Scientist - AWS Infrastructure](https://www.amazon.jobs/en/jobs/1587231/data-scientist-aws-infrastructure)
**Employer:** Amazon
**Team:** AWS Data Science
**Location:** Arlington Area, VA
### Required:
* Advanced degree (M.S. or Ph.D.) in Engineering, Math, Statistics, Finance, Computer Science, or related
industry experience.
* 3+ Years of experience in data science/analysis/engineering
* 2+ Years of experience applying Statistics/Data Science/Machine Learning
* 2+ Years of Scripting experience in Python/R or other scripting languages
* 2+ Years of SQL experience
* **2+ Years of experience in Data Visualization, using Tableau, R Shiny, other off the shelf products,
or scripting directly**
### Nice to have:
* Experience in modeling and optimization
* Working knowledge of AWS tech stack.
* Experience with clustered data processing (e.g. Hadoop, Spark, Map-reduce, Hive)
* **Experience in communicating technically, at a level appropriate for the audience.**
## Job 6 [Language Engineer](https://www.amazon.jobs/en/jobs/1603588/language-engineer)
**Employer:** Amazon
**Team:** Alexa Speech
**Location:** US, CA
### Required:
* Knowledge of scripting languages (e.g. Python, bash)
* Knowledge of phonetics/phonology and ability to analyze/validate phonetic transcriptions
* Native or near-native fluency in a non-English language
* **Excellent written and spoken communication skills**
### Nice to have:
* Masters in Computational Linguistics (or equivalent field with computational emphasis); alternatively,
2 years of experience in the field.
* Hands-on experience working with Natural Language Processing or Speech Processing
* Experience in writing grammars and building FSTs
* Strong personal interest in learning, researching, and creating new technologies related to foreign languages,
linguistics, phonetics, phonology and language technology
* Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment
## Required skills summary:
| Offer id | MSc | PhD | Communication | Visualization | AWS | Big Data | SQL | ML tools |
|----------|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|-------------------:|
| 1 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |
| 2 | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| 3 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :heavy_check_mark: |
| 4 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
| 5 | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
| 6 | :heavy_check_mark: | :x: | :heavy_check_mark: | :x: | :x: | :x: | :x: | :x: |