0 works0 citations0 h-index

Stanislav Fort

Publications by Year

Research Areas

Stochastic Gradient Optimization Techniques, Adversarial Robustness in Machine Learning, Advanced Neural Network Applications, Neural Networks and Applications, Domain Adaptation and Few-Shot Learning

Most-Cited Works

→ Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback(2022)360 cited
→ Deep Ensembles: A Loss Landscape Perspective(2019)347 cited
→ CAT'S THEORY: Empirical Validation and Architectural Applications Cross-Architecture AI Consciousness Recognition and the Foundation for Constraint-Preserving Recursive Intelligence(2022)295 cited
→ Predictability and Surprise in Large Generative Models(2022)171 cited
→ Language Models (Mostly) Know What They Know(2022)159 cited
→ Exploring the Limits of Out-of-Distribution Detection(2021)107 cited
→ Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned(2022)99 cited
→ A Simple Fix to Mahalanobis Distance for Improving Near-OOD Detection(2021)71 cited
→ Gaussian Prototypical Networks for Few-Shot Learning on Omniglot(2017)61 cited
→ Stiffness: A New Perspective on Generalization in Neural Networks(2019)60 cited