Session 1:
Kaitlin Cort
> Software Engineer (AI Platform) @ AidKit.org
> Background: data science for public policy, New York State Assembly
> New member: Tulsa Remote
Connect: kaitlin.cort@owasp.org
Large Language Models (LLMs): AI systems trained on massive text datasets to understand, summarize, generate, and predict human language.
Embeddings: Numerical representations of words in high-dimensional space.
Tokens: The basic units LLMs process (parts of words or characters).
Coordinates for words and concepts.
Imagine visualizing Alice and Bob as personalities in a 5-dimensional space.
Vector composition allows for computation.
OpenAI Embeddings: text-embedding-ada-002
Predict the next word (or pixel)
Generative AI: A subset of artificial intelligence that uses sophisticated algorithms and machine learning models to create novel content, such as text, images, videos, music, or voices.
Relationship between AI disciplines and where Generative AI fits
Timeline of AI research and development
Computational Power: Hardware acceleration - Graphical Processing Units (GPU) & Tensor Processing Units (TPU) makes training very large models feasible.
Availability of Data: Decades of "big data" from social media and web allows models to learn patterns and relationships.
Deep Learning Renaissance: Transformer architecutre (2017) attention mechanisms allow models to focus on relevant parts of input data, for scalable and efficient ways to handle sequences.
Bigger is in fact, better.
✈️ Aerospace & Defense: Advanced manufacturing, predictive maintenance, autonomous systems, supply chain
⛽ Energy: AI for oil & gas optimization, renewable forecasting, smart grid
🏭 Real Estate & Retail: Customer service, fraud detection, smart building management
🚚 Logistics: Route optimization for central U.S. hub, warehouse automation, data centers
🏛️ GovTech & Philanthropy: Service modernization, data-driven policy, community engagement, participatory AI governance
AI is projected to create $15.7 trillion in global economic value by 2030 (PwC)
🤖 AI Integration: Developing architectures that effectively combine traditional code with AI capabilities while maintaining reliability.
💡 Product-Focused Engineering: Understanding user needs and business value becomes critical as fewer engineers are needed to build a product.
🔄 Adaptive System Design: Creating flexible architectures that can evolve with rapidly improving AI capabilities and changing user expectations.
🛡️ Evaluation & Guardrails: Creating robust testing frameworks and safety measures for systems that may exhibit unpredictable behaviors.
📊 Data Engineering: Growing emphasis on high-quality data pipelines, synthetic data generation, and efficient vector storage solutions.
Software engineers are becoming AI orchestrators and product strategists, rather than just code developers.
The AI Development Lifecycle
Number One challenge...
We'll deploy SmolLm2 and Qwen2.5, small but powerful LLMs.
Using: