Property Checking with Interpretable Error Characterization for Recurrent Neural Networks
Resumen:
We propose a procedure for checking properties of recurrent neural networks used for language modeling and sequence classification. Our approach is a case of black-box checking based on learning a prob- ably approximately correct, regular approximation of the intersection of the language of the black-box (the network) with the complement of the property to be checked, without explicitly building individual represen- tations of them. When the algorithm returns an empty language, there is a proven upper bound on the probability of the network not verifying the requirement. When the returned language is nonempty, it is certain the network does not satisfy the property. In this case, an explicit and inter- pretable characterization of the error is output together with sequences of the network truly violating the property. Besides, our approach does not require resorting to an external decision procedure for verification nor fixing a specific property specification formalism.
2020 | |
Artificial intelligence Machine Learning Verification Ciencias Naturales y Exactas Ciencias de la Computación e Información |
|
Inglés | |
Agencia Nacional de Investigación e Innovación | |
REDI | |
https://hdl.handle.net/20.500.12381/458 | |
Acceso abierto | |
Reconocimiento-NoComercial 4.0 Internacional. (CC BY-NC) |