What is AI ?
AI, or Artificial Intelligence, is the ability of a computer or machine to perform tasks that normally require human intelligence, such as learning, problem-solving, and decision-making.It uses technologies like machine learning to analyze data, recognize patterns, and act intelligently to achieve specific goals without being explicitly programmed for every single task
Examples:
chatbots, self-driving cars, facial recognition, and smart home assistants(SIRI , Alexa)
Approaches to AI ?
Symbolic/Rule-Based: Traditional coding practices (Expert systems, decision trees, rule engines, logic programming.) -- Legacy
Machine Learning: AI that learn from data,identifying patterns and relationships to make predictions or decisions without rely on rules (like Linear regression, neural networks, support vector machines, random forests.)
Evolutionary: Genetic algorithms, evolutionary strategies, ant colony optimization, particle swarm optimization.
Hybrid: Combining symbolic reasoning and neural networks like neuro-symbolic AI.
Supervised Learning vs Unsupervised Learning:
Supervised Learning:
You train the model with labeled examples - data where you already know the correct answer. The model learns to map inputs to outputs based on these examples.(train data with both i/p and o/p)
Common tasks:
- Classification (spam detection, image recognition)
- Regression (predicting house prices, stock prices)
Ex: Training a model to recognize cats in photos by showing it thousands of images labeled "cat" or "not cat" (we saw in overfitting ,underfitting)
Unsupervised Learning:
You train the model with unlabeled data - no correct answers provided. The model tries to find patterns, structure, or relationships in the data on its own.(train model only with input, then o/p will decided by ML model)
- Internally, it will create clusters(using clustering algorithms) based on input data and comparision will be done among clusters using Silhouette scores
Common tasks:
- Clustering (customer segmentation, grouping similar documents)
- Dimensionality reduction (data compression, visualization)
- Anomaly detection (fraud detection, network intrusion)
Example:
1) Giving a model customer purchase data and letting it discover natural groupings of similar customers, without telling it what those groups should be.
2) Targeting ads in instagram/facebook based on location, age,gender etc.,.where target(o/p) is keep on changing.
Summary:
Supervised learning generally gives more accurate, predictable results for specific tasks but requires the time and expense of labeling data. Unsupervised learning can discover hidden patterns you didn't know to look for, but the results can be harder to interpret and validate.
There's also semi-supervised learning (using a small amount of labeled data with lots of unlabeled data) and reinforcement learning (learning through trial and error with rewards), which combine aspects of both approaches.
What are Silhouette Scores?
Silhouette scores measure how similar an object is to its own cluster compared to other clusters. A high silhouette score(-1 to 1) indicates the object is well-matched to its own cluster and poorly matched to neighboring clusters.
Identifying Overfitting with Silhouette Scores:
If the training clusters have very high silhouette scores but these scores drastically drop when new data is clustered, it might indicate that the model has overfit to the training data and is not generalizing well to new samples.
Reinforcement Learning:
Key Components
- Agent: The learner/decision maker
- Environment: The world the agent interacts with
- State: Current situation of the agent
- Action: What the agent can do
- Reward: Feedback signal (positive or negative)
- Policy: Strategy the agent learns to maximize total rewards
The agent doesn't need labeled examples - it figures out the best strategy by experiencing consequences of its actions over time.
Ex: Game playing (like Chess) , Robots , Autonomous vehicles(tesla cars-self driving cars), recommendation systems (netflix , youtube suggesting ads, news feed learning) , finance & trading , healthcare(Personalized treatment recommendations that adapt based on patient responses), natural language (ChatGPT and similar models fine-tuned using RLHF (Reinforcement Learning from Human Feedback) to generate more helpful responses).
The key advantage is that RL can solve complex problems where the optimal solution isn't known in advance and must be discovered through interaction.
No comments:
Post a Comment
Please comment below to feedback or ask questions.