Skills
Data Engineering/Big Data: Proficient in Spark (Scala/PySpark), Flink, Hive, Airflow, EMR, Kafka, Apache Hive, Delta Lake, Athena, Redshift, Jenkins, CloudFormation, and Databricks.
Machine Learning/AI: Skilled in developing Chatbots, deep-learning networks (LSTM, RNN, Transformers etc.), and information retrieval. Experienced in creating models for anomaly detection (monitoring real-time databases/Kafka), real-time traffic prediction, entity resolution in social networks, and prediction/classification tasks. Additionally, adept at building innovative data visualization dashboards and ML/DL productization using Python, Scala/Java, Matlab, and R.
Full-Stack Web Technologies: Proficient in full-stack web development, incorporating components built with Machine Learning, NLP techniques, Ontology, and Graph/No-SQL/SQL database technologies. Utilizes Node.js, React, Flask (Python), Play framework (Scala + Akka), and Spring (Java/J2EE) in the cloud (AWS and Azure) with Docker/Kubernetes. Familiar with Bootstrap4, CSS, jQuery/Ajax.
Databases: Utilizes graph databases (Neo4J), NoSQL (Firebase, HBase, Cassandra, and MongoDB), and relational databases (HiveQL, Redshift, PostgreSQL, MS SQL, Oracle, and MySQL) systems.
Mobile Dev: Proficient in React Native and Ionic Framework.
Cloud: Experienced with Amazon AWS and Azure.
Research: During my master's at Tsinghua University (Beijing), I focused on entity resolution problems from 2011 to 2013. Subsequently, I served as a Research Assistant in the Knowledge Discovery Lab at Tennessee Tech University during my Ph.D. from 2014 to 2017. My research primarily centers on News Mining, Graph Mining, Anomaly Detection, Text Mining, and NLP.