Jhosimar Arias

Jhosimar Arias

Deep Learning Researcher

Biography

I am a part-time lecturer at Universidad Peruana de Ciencias Aplicadas and the founder of ML/DL Meetup AQP, a non-profit community of people interested in artificial intelligence. We organize study groups, symposiums, mentoring sessions, and meetups that emphasize the importance of AI research.

I received my Master's Degree in Computer Science at Institute of Computing , University of Campinas. My thesis focuses on the study of semi-supervised and unsupervised clustering based on deep generative models.

I graduated with a bachelor's degree in Systems Engineering from National University of Saint Agustine. During my undergraduate years I had the opportunity to participate in several competitions such as ACM/ICPC international programming contest representing my university and online competitions (TopCoder, HackerRank, LeetCode).

As part of my professional experience I worked as software engineer for zAgile, a startup located in San Francisco, integrating software technologies such as Salesforce, Confluence and Jira.

Interests

  • Deep Learning
  • Computer Vision
  • Probabilistic Machine Learning

Education

  • MSc. in Computer Science, 2018

    University of Campinas (Brazil)

  • BSc. in Systems Engineering, 2012

    National University of Saint Agustine (Peru)

News

Past News
  • [12/19] - Invited to give a talk about "The Power and Limits of Deep Learning" at "Chapter Week - Systems Engineering & Informatics" in Arequipa, Peru.
  • [10/19] - My work "Deep Clustering using MMD Variational Autoencoder and Traditional Clustering Algorithms" was accepted in the workshop of Sets & Partitions (NeurIPS), Vancouver, Canada.
  • [10/19] - My work "Semi-supervised Learning using Deep Generative Models and Auxiliary Tasks" was accepted in the 4th Bayesian Deep Learning workshop (NeurIPS), Vancouver, Canada.
  • [09/19] - Invited to give a talk about "Conferences and Opportunities in Artificial Intelligence" at "Artificial Intelligence Seminar" in Arequipa, Peru.
  • [06/19] - Attended the Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA.
  • [06/19] - Presented my work of semi-supervised classification as oral presentation and poster at the Latinx in AI Workshop (ICML), Long Beach, CA.
  • [06/19] - Invited to give a talk about "Foundations and Applications of Neural Networks" and to be part of the tech expo with my community at Xpotron 2019: Control of Dynamical and Aerospace Systems in Arequipa, Peru.
  • [05/19] - Invited to give a talk about "The Power and Limits of Deep Learning" and a practical session about "Unsupervised Learning" at Computer Day Puno 2019: Artificial Intelligence in Puno, Peru. [slides] [code]
  • [02/19] - Invited to give a talk about Deep Learning at the Machine Learning Seminar in Cuzco, Peru. [slides]
  • [01/19] - Presented my work of unsupervised clustering at the "First Peruvian Symposium on Deep Learning" in Arequipa, Peru. [slides]
  • [01/19] - Presented a tutorial about the Foundations of Unsupervised Deep Learning at the "First Peruvian Symposium on Deep Learning" in Arequipa, Peru. [slides]
  • [01/19] - Lead organizer of the First Peruvian Symposium on Deep Learning in Arequipa, Peru.
  • [01/19] - Presented my work of semi-supervised clustering at the X Peruvian Symposium on Artificial Intelligence in Arequipa, Peru. [slides]
  • [05/18] - Founded the community ML/DL Meetup AQP in Arequipa, Peru.
  • [02/18] - Defended my master's thesis at UNICAMP in Campinas, Brazil. [slides]
  • [01/18] - Invited to give a talk about my work presented at NIPS. International Conference on Computer Research in Arequipa, Peru. [slides]

Publications

Deep Clustering using MMD Variational Autoencoder and Traditional Clustering Algorithms

Sets & Partitions Workshop (NeurIPS 2019)

Semi-supervised Learning using Deep Generative Models and Auxiliary Tasks

4th workshop on Bayesian Deep Learning (NeurIPS 2019)

Deep Generative Models for Clustering: A Semi-supervised and Unsupervised Approach

Master Thesis

Is Simple Better?: Revisiting Simple Generative Models for Unsupervised Clustering

Second workshop on Bayesian Deep Learning (NIPS 2017)

Learning to Cluster with Auxiliary Tasks: A Semi-Supervised Approach

30th Conference on Graphics, Patterns and Images (SIBGRAPI 2017)

Projects

Few-Shot Image Classification

Implemented the few-shot learning assigments of the CS330 course offered by Stanford University, which include MANN, MAML and ProtoNet. Worked with Python and Tensorflow.

GMVAE for clustering

In this project, I implemented a Gaussian Mixture Variational Autoencoder by representing the Categorical latent variable with the Gumbel-Softmax distribution avoiding the problem of multiple gradient estimators used in marginalization. Experiments showed around ~80% of clustering accuracy with multilayer perceptrons. Worked with PyTorch and Tensorflow.

CS231n: Convolutional Neural Networks for Visual Recognition

Implemented the assignments given by the CS231n course offered by Stanford University, which covers different machine learning topics including image classifiers (kNN, SVM, Softmax), CNNs, RNNs, LSTMs and GANs. Worked with python and Tensorflow.

Trademark Image Retrieval using Deep Feature Maps

In this work, I present a study of transfer learning applied to trademark image retrieval. Initially selective search is used to obtain region proposals, which are forwarded through a pretrained CNN architecture (AlexNet, GoogleNet and ResNet) on the ImageNet dataset. Feature representations are improved by developing feature aggregation methods (avg-pool, max-pool and R-MAC) over intermediate layers. Finally re-ranking based on graph query specific fusion algorithm was applied to improve the results. Experiments demostrate that intermediate layers produce better results for image retrieval. It was possible to increase in ~15% the baseline (features of last layers) mean average precision (mAP). Worked with python and Caffe.

Leadership

ML/DL Meetup AQP is a Non-Profit community of people interested in Artificial Intelligence (AI), particularly in Machine Learning (ML) and Deep Learning (DL). We organize meetups in person and online reviewing books, lectures, research papers, and courses from top Universities and MOOC's. It's an open community to people of all levels of knowledge.

Among the activities we have organized:

Talks

Introduction to Deep Learning

I Training Course in Computer Science for the Programming and Development of Multiplatform Systems

Limitations and New Frontiers of AI

Artificial Intelligence: a scenario for Digital and Technological Transformation (Virtual Webinar)

Markov Decision Processes

Reinforcement Learning Study Group

Hyperparameter Tuning, Batch Normalization and Multiclass Classification

Deep Learning Study Group

Deep Learning: Applications, Challenges and Opportunities

Information technology trends and best practices for e-learning and business continuity in times of crisis (Virtual Webinar)

Competitive Programming

Oral Presentation in LXAI at ICML 2019

LatinX in AI Workshop at ICML 2019

Teaching

2022-2

  • CC126 - Introduction to Algorithms, Universidad Peruana de Ciencias Aplicadas (UPC), Peru
  • CC227 - Introduction to Deep Learning, Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2022-1

  • CC100 - Programming I, Universidad Peruana de Ciencias Aplicadas (UPC), Peru
  • CC68 - Algorithms and Data Structures, Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2021

  • CC199 - Emerging Topics in Technology (Introduction to Deep Learning), Universidad Peruana de Ciencias Aplicadas (UPC), Peru

2020

  • Introduction to Machine Learning, Short Course, La Salle University, Peru
  • Machine Learning (Co-Instructor), Data Science Certificate Program, La Salle University

2016

Blog

In 2012, I started a blog in Spanish on algorithms and programming in order to help people better understand the algorithms and data structures used in competitive programming and programming projects. Although I am not very active on this blog, the algorithm explanations I posted are very useful nowadays (more than ~12000 views per month).

The explanations are given step by step with graphical examples, code and exercises. Some of the most interesting posts based on user statistics are: