Learn About Me
For most uptodate information about me, please visit my LinkedIn Page
I am a Ph.D. candidate in computer science under the supervision of Prof. Bhaskar Krishnamachari, and I expect to graduate in May 2024. My research interests include machine learning, data-driven algorithms, anomaly detection, and edge computing. As a research assistant at the ANRG lab of USC, I work on developing and evaluating machine learning models for detecting and analyzing various attacks on IoT networks, such as DDoS and Sybil attacks. I have over four years of experience in this field, and I have published several papers in top-tier conferences and journals.
Previously, I was a software engineer intern at Google, where I worked on designing NLP/ML based systems to automatically generate ads and campaigns. I developed and integrated a prompt-engineered 24B LLM in production for generating search ads keywords tailored to advertisers' queries, resulting in high-quality and relevant keywords. I also designed and evaluated a prompt-engineered and fine-tuned 24B LLM on the ads help articles for addressing advertisers' general questions, achieving high relevance in addressing advertiser inquiries.
I am passionate about applying machine learning to solve real-world problems. I am always eager to learn new technologies and techniques, and to collaborate with other engineers and researchers. I am looking for opportunities to further advance my skills and knowledge, and to contribute to cutting-edge projects in these fields.
Education
Work Experience
• Developed and integrated a prompt-engineered 24B LLM in production for generating search ads keywords tailored to advertisers' queries, resulting in ∼90% relevance and high-quality keywords suitable for production use.
• Designed and evaluated a prompt-engineered and fine-tuned 24B LLM on the ads help articles for addressing advertisers' general questions; adopted the fine-tuned model, achieving ∼90% relevance in addressing advertiser inquiries.
• Implemented workflows/dashboard for continuous evaluation of non-EN text ads quality.
• Improved quality of automatically generated text ads through heuristic and ML modeling solutions:
- Optimized title-based headline extractor filters, increasing the selection of high-quality headlines by ∼16%.
- Improved mismatched language filter model accuracy by ∼93%, significantly reducing text ads rejections.
- Implemented an ML workflow to aggregate and analyze data for enhancing the grammaticality model of non-EN text ads.
Research Experience
• Distributed Denial of Service (DDoS) Attack Analysis:
- Synthesized and enhanced a real IoT time-series dataset with 350M samples, using real IoT network traffic distribution.
- Developed tunable DDoS attacks on IoT nodes with different start times, duration, and traffic patterns.
- Designed DDoS detection models using MLP, CNN, LSTM, AEN, and GCN with F1-score up to 91%
- Designed and evaluated both prompt-engineered and fine-tuned GPT-3.5-turbo for DDoS detection reasoning.
• Sybil Attack Analysis:
- Preprocessed and feature engineered a mobile encounter dataset with more than 3M samples.
- Designed Sybil attackers mimicking the behavior of real nodes using WGAN model
- Developed a GCN-based detection model with 91% F1-score, outperforming traditional MLP models by 10%
• Living Off The Land Binary (LOLBin) Malware Analysis:
- Implemented FastText, BERT, RoBERTa, and SBERT embedding for Unix command representation.
- Designed hybrid LOLBin detection/reasoning mechanism using LSTM/GPT-3.5-turbo with 94% F1-score
• University Digital Twin (Gemini):
- Created a unique students’ course registration dataset with more than 200K samples.
- Modeled epidemic spread on a school campus and developed simulations to analyze the impact of different policies.
- Designed and developed a greedy course scheduling algorithm, improved building entrance delay up to 3×.
- Led a group of two students developing a real-time simulation web app dashboard
• Administrating the ANRG Lab Linux servers
Worked on Mobile Cloud Computing and Applied Machine Learning. Specifically, I’m working on Markov Chain, Markov Decision Process, Dynamic Programming, Machine Learning, Hidden Markov Model, Classification, Reinforcement Learning.
Worked on software-defined networks (SDN). I used the Mininet network emulator to create a virtual SDN network for OpenDaylight (ODL) to control. I developed a topology which had several ODL controllers and could control the TCP/UDP traffic generated by the nodes in the network. I also worked on the Virtual Machines and tried to make a connection between them with a distributed virtual multilayer switch called “OpenVSwitch.”