Jhosimar Arias

Jhosimar Arias

Deep Learning Researcher, Machine Learning Developer, Private Tutor

Biography

I am a freelance ML developer and a private tutor at Superprof and tusclasesparticulares mentoring undergraduate and master’s students from various countries and universities. Previously, I served as a lecturer at Universidad Peruana de Ciencias Aplicadas and I founded of ML/DL Meetup AQP, a non-profit community of people interested in artificial intelligence.

I received my Master's Degree in Computer Science at Institute of Computing , University of Campinas. My thesis focuses on the study of semi-supervised and unsupervised clustering based on deep generative models.

I graduated with a bachelor's degree in Systems Engineering from National University of Saint Agustine. During my undergraduate years I had the opportunity to participate in several competitions such as ACM/ICPC international programming contest representing my university and online competitions (TopCoder, HackerRank, LeetCode).

As part of my professional experience I worked as software engineer for zAgile, a startup located in San Francisco, integrating software technologies such as Salesforce, Confluence and Jira.

Interests

  • Deep Learning
  • Computer Vision
  • Probabilistic Machine Learning
  • Natural Language Processing

Education

  • MSc. in Computer Science, 2018

    University of Campinas (Brazil)

    GPA: 4.0/4.0

  • BSc. in Systems Engineering, 2012

    National University of Saint Agustine (Peru)

News

Past News

Publications

Deep Clustering using MMD Variational Autoencoder and Traditional Clustering Algorithms

Sets & Partitions Workshop (NeurIPS 2019)

Semi-supervised Learning using Deep Generative Models and Auxiliary Tasks

4th workshop on Bayesian Deep Learning (NeurIPS 2019)

Deep Generative Models for Clustering: A Semi-supervised and Unsupervised Approach

Master Thesis

Is Simple Better?: Revisiting Simple Generative Models for Unsupervised Clustering

Second workshop on Bayesian Deep Learning (NIPS 2017)

Learning to Cluster with Auxiliary Tasks: A Semi-Supervised Approach

30th Conference on Graphics, Patterns and Images (SIBGRAPI 2017)

Projects

COVID-19 lung lesion segmentation

This work addresses the challenge of semantic segmentation for COVID-19 CT lung lesions using three different models: U-Net, TransUNet, and Swin-Unet. These models were selected for their representation of pure CNNs, a combination of CNNs and Transformers, and pure Transformer architectures, respectively. The results show that all three models achieved an Intersection over Union (IoU) greater than 70% and a Dice coefficient exceeding 80%. Among them, TransUNet delivered the best performance, but with over 105M parameters. In contrast, U-Net, a simpler architecture, achieved similar results with significantly fewer parameters (30M), demonstrating that CNN-based architectures remain competitive with Transformers for semantic segmentation tasks.

Task-specific knowledge distillation of BERT

This project focused on task-specific knowledge distillation of BERT to smaller models, such as BiLSTM, 1D-CNNs, and small BERT (with fewer layers), for the classification of COVID-19 related tweets. The BERT-base model was first fine-tuned on the dataset, and then distilled to smaller models using soft-labels with KL-divergence of the predictions or MSE loss on the logits. The best performing model retained 94% of BERT's performance while reducing the number of parameters by 97%. Worked with PyTorch and Hugging face

GMVAE for clustering

In this project, I implemented a Gaussian Mixture Variational Autoencoder by representing the Categorical latent variable with the Gumbel-Softmax distribution avoiding the problem of multiple gradient estimators used in marginalization. Experiments showed around ~80% of clustering accuracy with multilayer perceptrons. Worked with PyTorch and Tensorflow.

CS231n: Convolutional Neural Networks for Visual Recognition

Implemented the assignments given by the CS231n course offered by Stanford University, which covers different machine learning topics including image classifiers (kNN, SVM, Softmax), CNNs, RNNs, LSTMs and GANs. Worked with python and Tensorflow.

Trademark Image Retrieval using Deep Feature Maps

In this work, I present a study of transfer learning applied to trademark image retrieval. Initially selective search is used to obtain region proposals, which are forwarded through a pretrained CNN architecture (AlexNet, GoogleNet and ResNet) on the ImageNet dataset. Feature representations are improved by developing feature aggregation methods (avg-pool, max-pool and R-MAC) over intermediate layers. Finally re-ranking based on graph query specific fusion algorithm was applied to improve the results. Experiments demostrate that intermediate layers produce better results for image retrieval. It was possible to increase in ~15% the baseline (features of last layers) mean average precision (mAP). Worked with python and Caffe.

Leadership

ML/DL Meetup AQP is a Non-Profit community of people interested in Artificial Intelligence (AI), particularly in Machine Learning (ML) and Deep Learning (DL). We organize meetups in person and online reviewing books, lectures, research papers, and courses from top Universities and MOOC's. It's an open community to people of all levels of knowledge.

Among the activities we have organized:

Talks

Introduction to Deep Learning

I Training Course in Computer Science for the Programming and Development of Multiplatform Systems

Limitations and New Frontiers of AI

Artificial Intelligence: a scenario for Digital and Technological Transformation (Virtual Webinar)

Markov Decision Processes

Reinforcement Learning Study Group

Hyperparameter Tuning, Batch Normalization and Multiclass Classification

Deep Learning Study Group

Deep Learning: Applications, Challenges and Opportunities

Information technology trends and best practices for e-learning and business continuity in times of crisis (Virtual Webinar)

Competitive Programming

Oral Presentation in LXAI at ICML 2019

LatinX in AI Workshop at ICML 2019

Teaching

2023-1

  • CC227 - Introduction to Deep Learning, Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2022-2

  • CC126 - Introduction to Algorithms, Universidad Peruana de Ciencias Aplicadas (UPC), Peru
  • CC227 - Introduction to Deep Learning, Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2022-1

  • CC100 - Programming I, Universidad Peruana de Ciencias Aplicadas (UPC), Peru
  • CC68 - Algorithms and Data Structures, Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2021

  • CC199 - Emerging Topics in Technology (Introduction to Deep Learning), Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2020

  • Introduction to Machine Learning, Short Course, La Salle University, Peru
  • Machine Learning (Co-Instructor), Data Science Certificate Program, La Salle University

2016

Blog

In 2012, I started a blog in Spanish on algorithms and programming in order to help people better understand the algorithms and data structures used in competitive programming and programming projects. Although I am not very active on this blog, the algorithm explanations I posted are very useful nowadays (more than ~12000 views per month).

The explanations are given step by step with graphical examples, code and exercises. Some of the most interesting posts based on user statistics are: