Andreas Kefalas

Data Engineer

  • Programming & Tools: Python (pandas, numpy, SciPy), R (ggplot2, Matplotlib, Seaborn), Bash scripting, SaaS Products, Redis
  • Data Manipulation, ETL and Visualization: Tableau, PowerBI, Apache Superset and Spark, SPSS, NetworkX, Grafana, Fivetran
  • Cloud & Databases: AWS (EC2, S3, Lamda, Lake Formation), Snowflake, SQL, PostgreSQL, AWS Redshift, Jenkins, Docker Containers, Cron, Kubernetes
  • Agile Environment: Pivotal Tracker, Jira, Slack, ClickUp
  • "Need it by 4pm this Friday" last resource skill : Friday 4 PM Taskmaster's Handbook
  • Data Engineer, Tru Solutions, Remote (May 2023 - Present)
    • Analyzed energy project data, optimized operations, and developed real-time visualization dashboards using industry-standard tools to enhance IT sector insights.
    • Integrated internal metrics automation, transitioning manual processes to cloud-based solutions and automating tasks to improve efficiency.
    • Proficient in handling both SQL and NoSQL databases for comprehensive data management solutions.
    • Automated API requests using Python scripts and Jenkins, leveraging AWS Lake Formation to extract, store, and analyze large volumes of API data.
    • Implemented data engineering solutions for energy projects, orchestrating the integration of multiple APIs using Fivetran to ingest data into Snowflake.
    • Achieved a significant 30% improvement in data reporting accuracy for top management.
    • Conducted data analysis for the oil rig and natural gas midstream pipeline industry, leveraging PostgreSQL expertise for efficiency.
    • Utilized AWS Lake Formation for scalable data extraction, storage, and analysis, and configured AWS EC2 instances to host and run a full BI tool stack.
    • Led the creation of custom productivity dashboards, optimizing operations at an energy facility by implementing Materialized Views and caching in Snowflake to reduce latency and warehouse costs.
  • Technical Project Manager, Tru Solutions, Remote (Aug 2022 - May 2023)
    • Managed projects and devised blueprints by collaborating with cross-functional teams, including Client Services and Product Development, to define project scope, objectives, and deliverables.
    • Engineered a predictive churn model resulting in a 25% decrease in customer attrition and optimized data pipelines, reducing data processing time by 30% and contributing to quicker product updates.
    • Consolidated all-in-one features such as device monitoring, ticketing, reporting, cloud backup, and remote control.
    • Delivered remarkable results, reducing service calls by 25% and on-site visits by 35% within the initial 2 months.
    • Articulated wireframes, composed automation tool requirements, and proficiently oversaw multiple projects alongside their pipelines, encompassing ticket integration, QA testing, and proactive communication with top management.
  • Scientific Research Assistant, CUNY Queens College, Queens, NY (Aug 2021 - Jan 2022)
    • Processed and visualized large data volumes, established connections via NetworkX, automated tasks for a 40% reduction in manual work, and used Python and R for clustered environment visualizations and metadata analysis in scientific journals.
  • Quantitative Analysis in R Lecturer, CUNY Queens College, Queens, NY (Jan 2022 - Aug 2022)
    • Developed lab assignments, evaluated homework and exams, provided feedback, offered guidance on improving code-writing skills, and taught a class of 25 students, aiding them in grasping key concepts, R package use, and executing coursework effectively.
  • Junior Developer, TruQC, Remote (May 2017 – Aug 2020)
    • Designed and implemented dynamic application reports leveraging JavaScript and JSON schemas for enhanced functionality and user experience.
    • Led the management of Continuous Integration builds in Jenkins, ensuring seamless environmental synchronization through timely fix commits.
    • Crafted custom SQL queries to generate insightful reports, driving data-driven decision-making.
    • Performed in an Agile environment, utilizing cutting-edge CI/CD tools like Jenkins and Pivotal Tracker for efficient project tracking and ticket management.

Masters of Arts (M.A.) in Data Analytics and Applied Social Research - GPA: 3.5/4.0 (Aug 2021 - May 2023)
Queens College, Queens, NY

Bachelor of Arts (B.S.) in Information Systems - GPA: 3.8/4.0 (Aug 2013 - Dec 2017)
University of Missouri - Saint Louis, Saint Louis, MO

NYC OPEN DATA - 311 Pre/Post Covid-19 Service Requesst Analysis (Jan 2023 - May 2023)
Extracted and modeled data from a 9 million-row, 13-year-old dataset, conducting comprehensive analysis with R, including regression testing, correlation analysis, and A/B testing methodologies, and created visualizations using the mapPluto R library to understand service request patterns across New York City's five boroughs.

Contact and Socials:

  • Location: NYC
  • Phone: 929-569-9420
  • Email: andrekef@gmail.com

Online Profiles:

View Resume: