profile

Shiyun Huang

Senior Memeber of Technical Staff at Oracle

Developer of Oracle Database Appliance

Education

  • Carnegie Mellon UniversityAug 2016 - Aug 2017
  • Master of Information Technology-eBusiness Technology
  • GPA: 3.88/4.0
  • Courses: Cloud Computing, Java for Application Programmer, J2EE Web Application Development
  • Extracurricular activity: Piano player for Carnegie Mellon University School of Computer Science Musical Group

  • University of Science and Technology Beijing Sept 2012 - July 2016
  • Bachelor of Accounting
  • GPA: 3.67/4.0
  • Courses: Macro-Micro Economics, Financial Management, Advanced Accounting, Advanced Mathmetics
  • Extracurricular activity: Player of university swimming team, badminton team and tennis team.

My Skills

  • Java and J2EE
  • HTML, CSS and JavaScript
  • MySQL and JDBC
  • HBase, MongoDB
  • Linux and Ubuntu
  • MapReduce and Hadoop
  • AWS, Azure, GCP

Internship


Product Manager Assistant Intern March - Aug 2016

Developed a web application in Java to demonstrate the middleware interactions with the web service interfaces and keep track of system status.

Generated meeting reports for each 3-week Sprint under Agile software development environment.



Projects



RESTful Web Services for Twitter Data Analysis Feb - May 2017

Data pre-process: configured and setup MapReduce job flow on AWS Elastic MapReduce (EMR) to process 1TB JSON format tweets using Extract, Transform and Load (ETL).

Backend: handled concurrency, optimized schema design and cache mechanism for efficient read/write/reset/delete queries for MySQL and HBase.

Front end: built 2 RESTful web applications with Undertow and Jersey, deployed them on m-family instances and distributed traffic with Load Balancers, reaching RPS of over 10000.








Auto Scaling and Horizontal Scaling Project Feb 2017

Configured auto scaling policies for scale-in and scale-out; deployed an Auto Scaling Group for dynamically handling changing loads on the server and an Elastic Load Balancer for redirecting traffic to healthy Amazon EC2 instances.

Invoked AWS, Azure, Google Cloud Platform API to develop 3 horizontal scaling applications which will satisfy the performance requirement of handling 4000 requests per second.






Big Data Analysis based on Amazon AWSFeb 2017

Conducted parallel analysis and sequential analysis using MapReduce on over 128G Wikipedia page view data to find trending topics near the US Election Day.

Designed algorithms for data filtering, matching and sorting and processed JSON data using Java. Configured and setup MapReduce job flow on AWS Elastic MapReduce (EMR) and launched an Ubuntu instance on Amazon EC2 for result display on a web UI.





Input Text Predictor with NoSQL Databases April 2017

Generated n-gram and language probabilistic models using Hadoop MapReduce on 500GB XML dataset and stored in HBase and Redis.

Responded frontend input phrase with sorted prediction words in probability descending order.







Yelp Database Analysis with MySQL and HBase Feb 2017

Created the HBase Cluster with AWS EMR and to load Yelp's data files into HBase.

Used HBase Java API to programmatically connect and query Hbase.

Used the JDBC driver to programmatically connect and execute queries on MySQL.











Requirement Elicitation and Software Design for Parking Management System Nov-Dec 2016

Documented a requirement specification by analyzing the raw requirements and classifying them into functional requirements, business/technical constraints, and quality attributes requirements.

Designed the architecture the system from static, dynamic, and physical perspectives.

Utilizing UML diagrams to document detailed design and database design including package diagram, sequential diagram, and E-R diagram.







Ubiquitous Computing: Hospital 4.0 Solution for UPMC Sept 2016

Utilized Internet of Things (IoT) to design four integrated intelligent healthcare systems with four sub-systems including food service, cost accounting, emergency response, and surgical safety.

Presented the above solutions with a 10-minute PowerPoint demo.