Technion ECE DL Course
The course is given in the Electrical and Computer Engineering Faculty at the Technion (ECE 046211) and covers several topics in deep learning (DL), with emphasis on supervised approaches to DL. Please visit our GitHub for the specific agenda and topics.
Project Title | Students | TL;DR | GitHub/YouTube |
---|---|---|---|
Model Compression and LoRA | Ido Blayberg & Ohad Amsalem | We tested the effectiveness of adding LoRA layer to a pre-trained compressed model for image classification task | GitHub Link YouTube Link |
Suicidal Prediction Transformer | Uri Koren & Daniel Ochana | Predicting the need for suicide watch based on text, using transfomers (from scratch, GPT2, ChatGPT) | GitHub Link YouTube Link |
Facial Emotion Detection Using GAN-generated Emotion Augmentations | Reuven Smbatyan & Amit Shalev | Leveraging EmotionGAN to enhance training process by creating emotion augmentations and expanding/balancing our dataset | GitHub Link |
Song Sentiment Analysis | Zohar Milman & Dor Aviv | Fine-tuning of BERT model for the purposes of song lyrics sentiment analysis using LoRA and hyper-parameter sweep | GitHub Link YouTube Link |
Predicting A Song’s Popularity Using Mamba | Itamar Horowitz & Rebecca Azoulay | Implementing a Mamba architecture in order to predict the popularity of a song | GitHub Link YouTube Link |
DSP in Deep Learning – Morse Code Detector + Decoder | Maor Assayag & Eliraz Kadosh & Eliram Amrusi | We show that LSTM-RNN decodes Morse code at -5dB SNR with 2% CER, and Faster-RCNN (RESNET) achieves 98% IoU-x accuracy for Morse code detection | GitHub Link YouTube Link |
Earthquake Prediction | Tzvi Tal Noy & Daniel Levi | Forecasting seismic activity, aiding in disaster management and mitigation efforts using RWKV | GitHub Link |
Age Estimation from Chest X-ray | Iggy Segev Gal & Eyal Gilron | Experimenting with different transfer learning methods to estimate the age of a person from his chest Xray image | GitHub Link |
Sherleaf-Holmes - Plant Species and Disease Type Classifier | Lee Benyamin & Ben Segal | Exploring different typs of architectures using transfer learning on image classification task - VGG, Alexnet, Resnet, Densenet, ViT + Lora & Dora | GitHub Link YouTube Link |
GPT2-based sEMG Gesture Classifier | Yuval Gerzon & Stav Belyy | Transfer learning to classify EMG movement, overcoming small amount of labeled data, by training a GPT2 transformer-decoder model | GitHub Link |
Music Genre Classification with Transformers | Yuval Hoffman & Roee Hadar | Using Transformers to classify music generes | GitHub Link YouTube Link |
ViT vs. CNN for Elephants Classification | Ariel Lulinsky & Hadar Hai | Evaluating Convolutional Neural Networks (CNN) and Vision Transformers (ViT) for distinguishing between Asian elephants and African elephants | GitHub Link YouTube Link |
Learning to Play 20 Questions | Yotam Norman & Shir Rotman | Leveraging pre-existing LLM capabilities for training a chatbot to play 20 questions | GitHub Link |
This Image Does Not Exist… Or Is It? | Almog Anschel & Ran Elbaz | Using ViTs and Transfer Learning to evaluate if an image is generate by AI or not | GitHub Link |
Basketball Action Recognition | Tal Dugma & Yonatan Ashlag & Yarin Bekor | Real-time recognition of basketball players and their actions on court | GitHub Link YouTube Link |
Chaotic Time-Series Prediction using RNNs | Shaked Leslau & Rishona Daniels | Lorenz ‘63 chaotic time-series prediction using LSTM, GRU, Tranformers, and Reservor computing | GitHub Link YouTube Link |
Project Title | Students | TL;DR | GitHub/YouTube |
---|---|---|---|
Cartoonify- Cartonn Augmentation | Stav Lotan & Harel Mendelman | Testing the efficacy of “Cartoon Augmentations” for image classification | GitHub Link YouTube Link |
MemeSense | Tomer Keniagin & Aviv Shem-Tov | Identifying hateful content in memes using combined text and image analysis | GitHub Link |
Music Genre Classifier | Amit Karp & Ilay Yavlovich | Classifying songs into their respective genres using advanced models | GitHub Link YouTube Link |
MRI Brain Tumor Patch-Based Classification | Nitzan Alt & Or Ronai | Detecting brain tumor types using MRI scans and transfer learning | GitHub Link YouTube Link |
How I Feel | Itamar Nierenberg & Nir Elfasi | Recognizing human emotions from facial expressions in images | GitHub Link YouTube Link |
RWKV-Based Music Generator | Ariel Suller & Liad Perl | Modifying LSTM-based music generation with an RWKV model | GitHub Link YouTube Link |
Uncertainty Score for Medical Segmentation Models | Daniel Katz & Natalie Mendelson | Evaluating segmentation model performance with uncertainty scores | GitHub Link YouTube Link |
ASL Translation from Images to Letters | Hadar Shloosh & Hadas Manor | Translating American Sign Language (ASL) images into English letters | GitHub Link |
FeelMyText : Emotion Classifier From Text Using Data Augmentations | Yossi Meshulam & Eran Yermiyahu | Classifying emotions within raw textual content with augmented data | GitHub Link YouTube Link |
Stock Price Prediction Using RWKV | Roee Latzres & Tomer Krichli | Forecasting stock prices using RWKV and numerical prediction | GitHub Link |
AI-based Position Estimation | Avichay Ashur & Itay Geva | Predicting device location based on radio frequency data | GitHub Link YouTube Link (HEB) YouTube Link (ENG) |
Facial Attribute Classification for Personalized Emoji Generation | Eitan Gorbonos & Amiel Gorbonos | Creating personalized emojis based on facial attributes | GitHub Link |
Music Genre Classification using Wav2Vec2 | Itai Allouche & Adam Katav | Improving music genre classification with sound data representation | GitHub Link |
ASLetter – Generating ASL Images From Letters | Nir Ben Haim & David Levit | Generating ASL images from English letters | GitHub Link |
Climbing Holds Classification | Ori Levi & Or Pomeranz | Accurate classification and segmentation of climbing holds | GitHub Link YouTube Link |
Enhancing Dense Crowd Counting CNNs | Rebecca Azoulay & Chris Shakkour | Improving the accuracy of crowd counting CNNs | GitHub Link YouTube Link |
Super Mario Play | Amir Mishael & Lavi Doron | Learn Super Mario Bros gameplay and predict next key press | GitHub Link |
Self-Learner - STL10 classification using BYOL representation | Tamar Sde Chen & Hadar Rosenber | Demonstrating an 11% improvement in supervised learning classification accuracy by leveraging self-supervised learning | GitHub Link YouTube Link |
Image2Recipe: Deep Learning Network for Image-to-Recipe Translation | Sagi Eyal & Loren Tzveniashvily | Outputting recipes from food images using Deep Learning | GitHub Link |
Efficient Use of Visual Transformers | Matan Millionschik& Yael Zak | Enhancing visual transformers with quantization and early exit | GitHub Link |
Prism Diffusion | Ron Raphaeli & Sofiia Kurbatova | Controlling colors in diffusion-generated images using conditional LoRA | GitHub Link |
FSLC - Fingerspelling Sign Language Classification | Samer Khair & Alam Shomary | Classifying fingerspelling sign language using CNNs | GitHub Link YouTube Link |
Human Activity Recognition using Accelerometers data | Yair Stolero & Shlomi Buhadana | Comparing models for Human Activity Recognition from accelerometer data | GitHub Link |
Style is All You Need | Evgeniy Pukhov & Dolev Hagai | Image style transfer using CNNs and optimization techniques | GitHub Link YouTube Link |
BrainSight: Leveraging Vision Transformers for Tumor Detection | Gil Litvin, Ari Shemesh & Omer Paz | Achieving 99.61% accuracy in brain tumor classification using Vision Transformers | GitHub Link |
Book Review Score Predictor | David Cojocaru & Ori Hagler | Predicting book review scores using NLP | GitHub Link YouTube Link |
Image Captioning with LSTM&GRU | Meir Lederman & Shahar Alpert | Captioning images using Encoder-Decoder with LSTM and GRU | GitHub Link |
Human or GPT: 2023 Turing Test | Noam Kasten & Muhammed Abu el Hija | Classifying text as human-authored or generated by GPT | GitHub Link YouTube Link |
Project Title | Students | TL;DR | GitHub/YouTube |
---|---|---|---|
ViT-BYOL | Shahar Yadin & Noa Shoham | Easily training a classifier with robustness to noise, without the need for explicitly showing it noisy images during training. | GitHub Link YouTube Link |
Adversarial Attacks on Face Recognition Models | Shelly Francis & Gil Kapel | Defencing Adversarial Attacks on Face Recognition Models. | GitHub Link YouTube Link |
Audio-based Instrument Detection | Roy Steinberg & Rotem Elimelech | With careful pre-processing and choice of loss functions, we predict multiple instruments in a musical piece. | GitHub Link YouTube Link |
TransferMPL with Roleplaying | Shira Lifshitz & Saar Stern | We propose a “RolePlaying” mechanism and improve the “Meta pseudo Labels” method. | GitHub Link YouTube Link |
Diffusion-based Data Augmentations | Eran Avneri & Itay Lamprecht | We show that using generative data augmentations with diffusion models improve the performence of CNN models. | GitHub Link YouTube Link |
Black and White Image Colorization | Aviv Ish Shalom & Salomon Malka | We use a UNet architecture and test various loss functions for colorizing grayscale images. | GitHub Link YouTube Link |
CLIP Your Food | Elizabet Khaimov & Ori Zehngut | We extract list of ingredients from a given input dish image using CLIP features. | GitHub Link YouTube Link |
Limited-data Face Classification using Transfer Learning | Amit Kadosh & Batel Shuminov | Using a limited collection of face images we improve face recognition with transfer learning. | GitHub Link YouTube Link |
Weights Pruning of Resnet50 Model | Michael Berko & Naomi Shapiro | We’ve managed to reducre nearly 40% of model’s parameters without changing its accuracy. | GitHub Link YouTube Link |
Contrastive-center Loss for Speaker Recognition | Lior Bashari & Yonatan Kleerekoper | Examining the effect of Contrastive-center loss regularization on Speaker Recognition using transfer learning with ResNet-18. | GitHub Link YouTube Link |
Football Player Detection | Avishav Engle & Daniel Hassid | Fine-tuned YOLOv7 NN to localize and classify footballer players and trained a BYOL network to cluster the teams. | GitHub Link |
RNN and Transformer-based Image Captioning | Itai Shufaro & Nir Luria | We implemented an encoder-decoder network for image captioning. The encoder is a pre-trained CNN, and for the decoder we used both LSTM and Transformer networks. | GitHub Link YouTube Link |
Generating Captions for Visual Stimuli from fMRI Scans | Yoav Tsoran & Roey Shafran | We use a pretained fMRI encoder and GPT2 to generate a caption of the image presented to a subject during an fMRI scan. | GitHub Link> YouTube Link |
IMDB Score Prediction from a Movie’s Trailer | Eran Mann & Zeev Zukerman | Predicting IMDB score from selected frames from the movie’s trailer. | GitHub Link YouTube Link |
ECG Classification with CNNs | Sagie Badoach & Gilad Altshuler | We classify ECG heartbeat to different disease classes, with simpler yet accurate (+98%) CNN model, and adapted the model to 2 tasks. | GitHub Link YouTube Link |
Image Caption Generator | Dana Rip | We compared two architectures for image captioning: LSTM and Transformer. | GitHub Link YouTube Link |
Boosting Classification with Estimated Depth | Mousa Arraf & Chen Katz | We boosted the results of classification models by using estimated depth as additional features. | GitHub Link |
Augmenting Textual Datasets with GPT | Almog Zur & Ari Granevich | We used GPT2 to augment the IMDB dataset. | GitHub Link |
Project Title | Students | TL;DR | GitHub/YouTube |
---|---|---|---|
Adaptive STFT | Noam Elata & Rotem Idelson | Expanding on existing application of image processing networks to audio using STFT, we propose an adaptive STFT layer that learns the best DFT kernel and window for the application. | GitHub Link YouTube Link |
Meta-Learning with Non-uniform Task Weights | Lior Friedman & Yair Nahum | We examine the effects of various task orderings and selections on meta-learning accuracy for images. | GitHub Link YouTube Link |
Weapon Detection Using Transfer-Learning | Alon Nemirovsky & Itamar Ginsberg | We use YOLOv5 pre-trained model to detect threats in the form of pistols/knives using a dataset of life-like scene images. | GitHub Link YouTube Link |
Sentence Transformer-VAE | Nofit Segal & Dan Haramati | We built a sentence VAE using the Transformer encoder-decoder architecture presented in “Attention Is All You Need” by Vaswani, Ashish, et al. | GitHub Link |
EEG Classification | Nitzan Shitrit & Priel Salomon | We implement a CNN for classification of EEG signals recorded while a subject is viewing image of digits from 0 to 9 as stimuli. | GitHub Link YouTube Link |
Beyond Inpainting - Combining Semantic Segmentation with Inpainting | George Pisha & Yevgeniy Men | We’ve implemented an end2end image segmentation and inpainting pipeline for auto-editing of images and extend the framework to video automatic object removal. | GitHub Link |
Bitcoin Price Prediction Using Transformer | Yuval Baruch & Yuval Aidan | Predict Bitcoin short term price based on it’s former values using transformers architecture. | GitHub Link YouTube Link |
Bird Species Classification & Noise Robustness | Hodaya Rabinovich & Yael Ilan | We focused on several pre-trained classification architectures and tried to ‘attack’ them with noise, comparing the results. Finally, we tried to improve the result using augmentations. | GitHub Link |
Object Detection of Handwritten Circuit Diagrams | Dan Ilan Ben David & Adam Soker | Detecting and converting handwritten circuit diagrams to computer circuit diagram (LTspice). | GitHub Link YouTube Link |
Facial Expression Recognition with Attention Prior | Ido Terner & Yair Gat | We have explored using an attention prior for facial expression recongnition using an attention prior. | GitHub Link |
Malicious URL Detection | Edan Kinderman & Lev Panov | We detect Malicious URLs with LSTM and transformer architectures using character-level embedding methods. | GitHub Link |
Two (Gyros) is All You Need | Gal Ness & Elad Zohar | We use the XY timeseries of smartwear accelometers to predict the Z motion with Transformer and novel positional encodings. | GitHub Link |
Lips Don’t Lie | Tom Bekor & Mitchell Butovsky | Lip reading of a talking person without the use of audio, using facial-landmarks recognition and Transformer. | GitHub Link 1 GitHub Link 2 |
Face Aging with StyleGAN2-ADA | Moshe Rafaeli & Udi Gal | We have implemented a conditional image generation model to perform face aging with a StyleGAN2-ADA generator and n InceptionResnetV1 encoder. | GitHub Link |
Project Title | Students | TL;DR | GitHub/YouTube |
---|---|---|---|
Cassava Leaf Deasses Classification | Ido Gabay & Aviya Cohen | Classification of imbalanced dataset from Kaggle with several methods, using transfer learning and our own conv-net. | GitHub Link YouTube Link |
Traffic Signs Classification | Kfir Levi & Li-or Bar David | Comparing performances of different architectures on a traffic signs dataset. | GitHub Link YouTube Link |
Short Term Stock Price Prediction with a Simple Trading Bot | Orel Tsioni & Roee Ben Shlomo | Predicting a stock price 1 minute into the future and trying to achieve a profit using a bot. | GitHub Link |
Underwater Gesture Classification | Or De Goede & Matan Topel | Classifying gestures of divers in changing water conditions with 97.85% accuracy on the testset, using YOLOv5 for localization and our own CNN for classification. | GitHub Link |
Spoken Digit Generator | Aviv Eliav & Ran Ben-Shaul | Generating audio files of spoken digits, using a conditional generative architectures (cVAE, cGAN) and evaluating the results with Inception Score. | GitHub Link |
Using GAN generated images to improve classification small datasets | Daniel Bracha & Vladimir Kulikov | Using StyleGAN2-ADA to artificially generate more training images from the base dataset and comparing classification accuracy with and without the artificial data. | GitHub Link |
RMR prediction Using Body Composition and Metabolic Adaptation Features | Amir Kfir & Tomer Keren | Based on private-use medical dataset, we build a DL architecture to predict Metabolic Rate values on rest based on other body-features. We also compare our results with classic ML algorithms. | GitHub Link |
Sketch Classification | Roie Reshef & Gefen Dawidowicz | Sketch Classification on the “Quick, Draw!” dataset. Improves accuracy of an existing network by adding a special noise to the sketches called “shake pen”. | GitHub Link |
Triplet Network for Few Shot Learning | Daniel Teitelman & Gonen Weiss | Implementing the triplet network for metric learning than using the learned feature vectors for few shot learning on the FashionMNIST dataset. | GitHub Link YouTube Link |
Playing Chrome’s Dino Run with Deep Reinforcement Learning | Ari Frummer & Gal Kinberg | Comparing the performence of several DQN architectures and methods on the Dino Run game, including a classical DQN, a Dueling DQN, and using data augmentations. | GitHub Link |
COVID19 Detection from CT Scans | Lin Sinorodin | Compare the performance of CNN vs. Vision Transformer as feature extraction models on image classification task (using transfer learning). | GitHub Link |
Predicting the NASDAQ 100 using LSTM and GRU network | Roy Mahlab & Tamir Haver | Predicting the NASDAQ 100 index 1 week ahead using LSTM/GRU on the Federal Reserve total assets compared to a naive classifier. | GitHub Link |
Image Caption with Attention | Neria Uzan & Razi Facheldeen | Using CNN and RNN methods build a network that can describe an image content. | GitHub Link |
Music Genre Classifier | Omer Cohen & Jonathan Nir Shalit | Music genre classification using several Deep Learning architectures and methods. | GitHub Link YouTube Link |
Mutual Information In Stock Prediction | Aviv Ratzon & Chay Guez | Training Transformer to predict next day closing prices of various stocks, and then adding information about related stocks and measure change in performance. | GitHub Link YouTube Link |
Image Captioning | Luiz Wainstein & Yahel Kleinman | Using Encoder Decoder networks to generate descriptive sentences on image input. | GitHub Link |
Sentiment Analysis of Machine Summarized Text | Roy Elkabetz | Does google’s pre-trained T5 general NLP model preserve sentiment of text when used as a summarizer? Let’s find out. | GitHub Link |
Covid 19 Prediction from Chest X-Ray Images | Ariel Weizman & Yael Shavit | Using Transfer Learning from a model we pre-trained on detecting Pneumonia to the task of identifying COVID-19. | GitHub Link |
UnmaskMe - Unmasked Face Generation | Ron Vainshtein & Shay Shimonov | Generating unmasked face images from masked face images using pix2pix cGAN architecture. | GitHub Link YouTube Link |
For more information/questions, pleaes contact Tal Daniel.