Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Felippe Schmoeller Roza, Simon Hadwiger, Ingo Thon, Karsten Roscher |
Towards Safety Assurance of Uncertainty-Aware Reinforcement Learning Agents. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Khondoker Murad Hossain, Tim Oates 0001 |
Backdoor Attack Detection in Computer Vision by Applying Matrix Factorization on the Weights of Deep Networks. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Saaduddin Mahmud, Sandhya Saisubramanian, Shlomo Zilberstein |
REVEALE: Reward Verification and Learning Using Explanations. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ricardo Luna Gutierrez, Antonio Guillen, Avisek Naug |
Robustness with Black-Box Adversarial Attack using Reinforcement Learning. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Weimin Zhao, Sanaa A. Alwidian, Qusay H. Mahmoud |
Evaluation of GAN Architectures for Adversarial Robustness of Convolution Classifier. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Maryam Bagheri 0001, Josephine Lamp, Xugui Zhou, Lu Feng 0001, Homa Alemzadeh |
Towards Developing Safety Assurance Cases for Learning-Enabled Medical Cyber-Physical Systems. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Sumanta Dey, Pallab Dasgupta, Soumyajit Dey |
Safe Reinforcement Learning through Phasic Safety-Oriented Policy Optimization. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Axel Brando, Isabel Serra, Enrico Mezzetti, Francisco J. Cazorla, Jaume Abella 0001 |
Standardizing the Probabilistic Sources of Uncertainty for the sake of Safety Deep Learning. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Soumyadeep Pal, Ren Wang 0008, Yuguang Yao, Sijia Liu 0001 |
Towards Understanding How Self-training Tolerates Data Backdoor Poisoning. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Teddy Ferdinan, Jan Kocon |
Personalized Models Resistant to Malicious Attacks for Human-centered Trusted AI. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Peter Barnett, Rachel Freedman, Justin Svegliato, Stuart Russell 0001 |
Active Reward Learning from Multiple Teachers. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Valency Oscar Colaco, Simin Nadjm-Tehrani |
Formal Verification of Tree Ensembles against Real-World Composite Geometric Perturbations. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Stephen Casper, Dylan Hadfield-Menell, Gabriel Kreiman |
White-Box Adversarial Policies in Deep Reinforcement Learning. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Alberto Huertas Celdrán, Jan Kreischer, Melike Demirci, Joel Leupp, Pedro Miguel Sánchez Sánchez, Muriel Figueredo Franco, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller |
A Framework Quantifying Trustworthiness of Supervised Machine and Deep Learning Models. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Yize Li, Pu Zhao 0001, Xue Lin, Bhavya Kailkhura, Ryan A. Goldhahn |
Less is More: Data Pruning for Faster Adversarial Training. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Fateh Kaakai, Paul-Marie Raffi |
Towards Multi-timescale Online Monitoring of AI Models. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Chiara Picardi, Richard Hawkins, Colin Paterson, Ibrahim Habli |
Transfer Assurance for Machine Learning in Autonomous Systems. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Chenyang Yang 0002, Rachel A. Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu |
Capabilities for Better ML Engineering. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Dirk Eilers, Simon Burton 0001, Felippe Schmoeller da Roza, Karsten Roscher |
Safety Assurance with Ensemble-based Uncertainty Estimation and overlapping alternative Predictions in Reinforcement Learning. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Matthias König 0005, Annelot Bosman, Holger H. Hoos, Jan N. van Rijn |
Critically Assessing the State of the Art in CPU-based Local Robustness Verification. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Chen Chen, Haibo Hong, Mande Xie, Jun Shao, Tao Xiang |
Bab: A novel algorithm for training clean model based on poisoned data. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Tian Tan 0018, Carlos Huertas, Qi Zhao |
Efficient and Effective Uncertainty Quantification in Gradient Boosting via Cyclical Gradient MCMC. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Fabio Arnez, Ansgar Radermacher, François Terrier |
Out-of-Distribution Detection Using Deep Neural Network Latent Space Uncertainty. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Juliette Mattioli, Henri Sohier, Agnès Delaborde, Gabriel Pedroza, Kahina Amokrane-Ferka, Afef Awadid, Zakaria Chihani, Souhaiel Khalfaoui |
Towards a holistic approach for AI trustworthiness assessment based upon aids for multi-criteria aggregation. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Václav Divis, Tobias Schuster, Marek Hrúz |
Domain-centric ADAS Datasets. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Nikiforos Pittaras, Sean McGregor |
A Taxonomic System for Failure Cause Analysis of Open Source AI Incidents. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon |
On Evaluating Adversarial Robustness of Chest X-ray Classification. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Gabriel Pedroza, Xiaowei Huang 0001, Xin Cynthia Chen, Andreas Theodorou, José Hernández-Orallo, Mauricio Castillo-Effen, Richard Mallah, John A. McDermid (eds.) |
Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), Washington DC, USA, February 13-14, 2023. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Maxime Fuccellaro, Laurent Simon, Akka Zemmari |
A Robust Drift Detection Algorithm with High Accuracy and Low False Positives Rate. |
SafeAI@AAAI |
2023 |
DBLP BibTeX RDF |
|
1 | Soyeon Jung, Ransalu Senanayake, Mykel J. Kochenderfer |
A Gray Box Model for Characterizing Driver Behavior. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Gabriel Pedroza, José Hernández-Orallo, Xin Cynthia Chen, Xiaowei Huang 0001, Huáscar Espinoza, Mauricio Castillo-Effen, John A. McDermid, Richard Mallah, Seán Ó hÉigeartaigh (eds.) |
Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), Virtual, February, 2022. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Mingjun Ma, Dehui Du, Yuanhao Liu, Yanyun Wang 0003, Yiyang Li |
Efficient Adversarial Sequence Generation for RNN with Symbolic Weighted Finite Automata. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Chloe He 0002, Borja G. León, Francesco Belardinelli |
Do Androids Dream of Electric Fences? Safety-Aware Reinforcement Learning with Latent Shielding. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Poulami Sinhamahapatra, Rajat Koner, Karsten Roscher, Stephan Günnemann |
Is it all a cluster game? - Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Sheila Alemany, Niki Pissinou |
The Dilemma Between Data Transformations and Adversarial Robustness for Time Series Application Systems. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Fangqi Li, Lei Yang, Shilin Wang, Alan Wee-Chung Liew |
Leveraging Multi-task Learning for Umambiguous and Flexible Deep Neural Network Watermarking. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Hal Ashton, Matija Franklin |
The Problem of Behaviour and Preference Manipulation in AI Systems. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Amany Alshareef, Nicolas Berthier, Sven Schewe, Xiaowei Huang 0001 |
Quantifying the Importance of Latent Features in Neural Networks. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Michal Filipiuk, Vasu Singh |
Comparing Vision Transformers and Convolutional Nets for Safety Critical Systems. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Mathieu Godbout, Maxime Heuillet, Sharath Chandra Raparthy, Rupali Bhati, Audrey Durand |
A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Jin Woo Ro, Gerald Lüttgen, Diedrich Wolter |
Reinforcement Learning With Imperfect Safety Constraints. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Peter Barnett, John Burden |
Oases of Cooperation: An Empirical Evaluation of Reinforcement Learning in the Iterated Prisoner's Dilemma. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Preston Putzel, Scott Lee |
Blackbox Post-Processing for Multiclass Fairness. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Hal Ashton |
Defining and Identifying the Legal Culpability of Side Effects using Causal Graphs. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Saeed Bakhshi Germi, Esa Rahtu |
A Practical Overview of Safety Concerns and Mitigation Methods for Visual Deep Learning Algorithms. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Bertrand Braunschweig, Rodolphe Gelin, François Terrier |
The wall of safety for AI: approaches in the Confiance.ai program. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Edoardo Manino, Danilo S. Carvalho, Yi Dong 0002, Julia Rozanova, Xidan Song, Mustafa A. Mustafa, André Freitas, Gavin Brown 0001, Mikel Luján, Xiaowei Huang 0001, Lucas C. Cordeiro |
EnnCore: End-to-End Conceptual Guarding of Neural Architectures. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Juliette Mattioli, Gabriel Pedroza, Souhaiel Khalfaoui, Bertrand Leroy |
Combining Data-Driven and Knowledge-Based AI Paradigms for Engineering AI-Based Safety-Critical Systems. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Pascal Gerber, Lisa Jöckel, Michael Kläs |
A Study on Mitigating Hard Boundaries of Decision-Tree-based Uncertainty Estimates for AI Models. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Ann-Katrin Reuel, Mark Koren, Anthony Corso 0001, Mykel J. Kochenderfer |
Using Adaptive Stress Testing to Identify Paths to Ethical Dilemmas in Autonomous Systems. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Ignacio Serna, Daniel DeAlcala, Aythami Morales Moreno, Julian Fiérrez, Javier Ortega-Garcia |
IFBiD: Inference-Free Bias Detection. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Leopold Müller, Lars Böcking, Michael Färber 0001 |
Safety Aware Reinforcement Learning by Identifying Comprehensible Constraints in Expert Demonstrations. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | John Mern, Sidhart Krishnan, Anil Yildiz, Kyle Hatch, Mykel J. Kochenderfer |
Interpretable Local Tree Surrogate Policies. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Zikang Xiong, Ishika Agarwal, Suresh Jagannathan |
HiSaRL: A Hierarchical Framework for Safe Reinforcement Learning. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Deebul S. Nair, Nico Hochgeschwender, Miguel A. Olivares-Méndez |
Maximum Likelihood Uncertainty Estimation: Robustness to Outliers. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Prajit T. Rajendran, Huáscar Espinoza, Agnès Delaborde, Chokri Mraidha |
Human-in-the-loop Learning for Safe Exploration through Anomaly Prediction and Intervention. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Adrian Schwaiger, Kristian Schwienbacher, Karsten Roscher |
Beyond Test Accuracy: The Effects of Model Compression on CNNs. |
SafeAI@AAAI |
2022 |
DBLP BibTeX RDF |
|
1 | Vibhu Gautam, Youcef Gheraibia, Rob Alexander, Richard Hawkins |
Runtime Decision Making Under Uncertainty in Autonomous Vehicles. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | David Lindner, Kyle Matoba, Alexander Meulemans |
Challenges for Using Impact Regularizers to Avoid Negative Side Effects. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Gavin Leech, Nandi Schoots, Joar Skalse |
Safety Properties of Inductive Logic Programming. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Franz Wotawa |
On the Use of Available Testing Methods for Verification & Validation of AI-based Software and Systems. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Nivasini Ananthakrishnan, Shai Ben-David, Tosca Lechner |
Classification Confidence Scores with Point-wise Guarantees. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Saasha Nair, Sina Shafaei, Daniel Auge, Alois C. Knoll |
An Evaluation of "Crash Prediction Networks" (CPN) for Autonomous Driving Scenarios in CARLA Simulator. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | John Hyatt, Michael Lee |
Multi-Modal Generative Adversarial Networks Make Realistic and Diverse but Untrustworthy Predictions When Applied to Ill-posed Problems. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Javier Hernandez-Ortega, Ruben Tolosana, Julian Fiérrez, Aythami Morales |
DeepFakesON-Phys: DeepFakes Detection based on Heart Rate Estimation. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Jan-Pieter Paardekooper, Mauro Comi, Corrado Grappiolo, Ron Snijders, Willeke van Vught, Rutger Beekelaar |
A Hybrid-AI Approach for Competence Assessment of Automated Driving functions. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Rob Ashmore, Alec Banks |
The Utility of Neural Network Test Coverage Measures. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Jared Markowitz, Marie Chau, I-Jeng Wang |
Deep CPT-RL: Imparting Human-Like Risk Sensitivity to Artificial Agents. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Franziska Schwaiger, Maximilian Henne, Fabian Küppers, Felippe Schmoeller Roza, Karsten Roscher, Anselm Haselhoff |
From Black-box to White-box: Examining Confidence Calibration under different Conditions. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Ernest Wozniak, Henrik J. Putzer, Carmen Cârlan |
AI-Blueprint for Deep Neural Networks. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Takuma Amada, Kazuya Kakizaki, Seng Pei Liew, Toshinori Araki, Joseph Keshet, Jun Furukawa 0001 |
Adversarial Robustness for Face Recognition: How to Introduce Ensemble Diversity among Feature Extractors? |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | John Burden, José Hernández-Orallo, Seán Ó hÉigeartaigh |
Negative Side Effects and AI Agent Indicators: Experiments in SafeLife. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Haiwen Huang, Zhihan Li, Lulu Wang, Sishuo Chen, Xinyu Zhou, Bin Dong |
Feature Space Singularity for Out-of-Distribution Detection. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Ville Vakkuri, Marianna Jantunen, Erika Halme, Kai-Kristian Kemell, Anh Nguyen-Duc 0001, Tommi Mikkonen, Pekka Abrahamsson |
Time for AI (Ethics) Maturity Model Is Now. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Jakub Tetek, Marek Sklenka, Tomas Gavenciak |
Performance of Bounded-Rational Agents With the Ability to Self-Modify. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Francesco Cartella, Orlando Anunciação, Yuki Funabiki, Daisuke Yamaguchi, Toru Akishita, Olivier Elshocht |
Adversarial Attacks for Tabular Data: Application to Fraud Detection and Imbalanced Data. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Hal Ashton |
What criminal and civil law tells us about Safe RL techniques to generate law-abiding behaviour. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Václav Divis, Marek Hrúz |
Neural Criticality: Validation of Convolutional Neural Networks. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Christopher Harper, Praminda Caleb-Solly |
Towards an Ontological Framework for Environmental Survey Hazard Analysis of Autonomous Systems. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Adrien Gauffriau, François Malgouyres, Mélanie Ducoffe |
Overestimation Learning with Guarantees. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Huáscar Espinoza, John A. McDermid, Xiaowei Huang 0001, Mauricio Castillo-Effen, Xin Cynthia Chen, José Hernández-Orallo, Seán Ó hÉigeartaigh, Richard Mallah (eds.) |
Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), Virtual, February 8, 2021. |
SafeAI@AAAI |
2021 |
DBLP BibTeX RDF |
|
1 | Huáscar Espinoza, José Hernández-Orallo, Xin Cynthia Chen, Seán S. ÓhÉigeartaigh, Xiaowei Huang 0001, Mauricio Castillo-Effen, Richard Mallah, John A. McDermid (eds.) |
Proceedings of the Workshop on Artificial Intelligence Safety, co-located with 34th AAAI Conference on Artificial Intelligence, SafeAI@AAAI 2020, New York City, NY, USA, February 7, 2020. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Sarthak Jindal, Raghav Sood, Richa Singh 0001, Mayank Vatsa, Tanmoy Chakraborty 0002 |
NewsBag: A Benchmark Multimodal Dataset for Fake News Detection. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Sina Mohseni, Mandar Pitale, Vasu Singh, Zhangyang Wang |
Practical Solutions for Machine Learning Safety in Autonomous Vehicles. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Bharat Prakash, Nicholas R. Waytowich, Ashwinkumar Ganesan, Tim Oates 0001, Tinoosh Mohsenin |
Guiding Safe Reinforcement Learning Policies Using Structured Language Constraints. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Melanie Ducoffe, Sébastien Gerchinovitz, Jayant Sen Gupta |
A High Probability Safety Guarantee for Shifted Neural Network Surrogates. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Lifeng Liu, Yingxuan Zhu, Tim Tingqiu Yuan, Jian Li |
Continuous Safe Learning Based on First Principles and Constraints for Autonomous Driving. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Kazuya Kakizaki, Kosuke Yoshida |
Adversarial Image Translation: Unrestricted Adversarial Examples in Face Recognition Systems. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Vahid Behzadan, Ibrahim M. Baggili |
Founding The Domain of AI Forensics. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Carroll L. Wainwright, Peter Eckersley |
SafeLife 1.0: Exploring Side Effects in Complex Environments. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Imane Lamrani, Ayan Banerjee 0001, Sandeep K. S. Gupta |
Toward Operational Safety Verification Via Hybrid Automata Mining Using I/O Traces of AI-Enabled CPS. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Michiel A. Bakker, Humberto Riverón Valdés, Duy Patrick Tu, Krishna P. Gummadi, Kush R. Varshney, Adrian Weller, Alex Pentland |
Fair Enough: Improving Fairness in Budget-Constrained Decision Making Using Confidence Thresholds. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Vojtech Kovarík, Ryan Carey |
(When) Is Truth-telling Favored in AI Debate? |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | John Burden, José Hernández-Orallo |
Exploring AI Safety in Degrees: Generality, Capability and Control. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Jin-Young Kim, Sung-Bae Cho |
Fair Representation for Safe Artificial Intelligence via Adversarial Learning of Unbiased Information Bottleneck. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Maximilian Henne, Adrian Schwaiger, Karsten Roscher, Gereon Weiss |
Benchmarking Uncertainty Estimation Methods for Deep Learning With Safety-Related Metrics. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Botty Dimanov, Umang Bhatt, Mateja Jamnik, Adrian Weller |
You Shouldn't Trust Me: Learning Models Which Conceal Unfairness From Multiple Explanation Methods. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Ashish Gaurav, Sachin Vernekar, Jaeyoung Lee, Vahdat Abdelzad, Krzysztof Czarnecki 0001, Sean Sedwards |
Simple Continual Learning Strategies for Safer Classifers. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Bowei Xi, Yujie Chen, Fei Fan, Zhan Tu, Xinyan Deng |
Bio-Inspired Adversarial Attack Against Deep Neural Networks. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|
1 | Ewen Denney, Ganesh Pai, Colin Smith |
Hazard Contribution Modes of Machine Learning Components. |
SafeAI@AAAI |
2020 |
DBLP BibTeX RDF |
|