Dynamic normativity : necessary and sufficient conditions for value alignment

Export this record:

Please use this identifier to cite or link to this item: https://tede2.pucrs.br/tede2/handle/tede/11294

Full metadata record

DC Field	Value	Language
dc.creator	Corrêa, Nicholas Kluge	-
dc.creator.Lattes	http://lattes.cnpq.br/2495127333199411	por
dc.contributor.advisor1	Oliveira Junior, Nythamar Hilario Fernandes de	-
dc.contributor.advisor1Lattes	http://lattes.cnpq.br/3541527557611037	por
dc.contributor.advisor-co1	Schulz, Michael	-
dc.date.accessioned	2024-06-21T18:56:14Z	-
dc.date.issued	2024-05-13	-
dc.identifier.uri	https://tede2.pucrs.br/tede2/handle/tede/11294	-
dc.description.resumo	The critical inquiry pervading the realm of Philosophy, and perhaps ex-tending its influence across all Humanities disciplines, revolves around the intricacies of morality and normativity. Surprisingly, in recent years, this thematic thread has woven its way into an unexpected domain, one not conventionally associated with pondering "what ought to be": the field of artificial intelligence (AI) research. Central to morality and AI, we find "alignment", a problem related to the challenges of expressing human goals and values in a manner that artificial systems can follow without lead-ing to unwanted adversarial effects. More explicitly and with our cur-rent paradigm of AI development in mind, we can think of alignment as teaching human values to non-anthropomorphic entities trained through opaque, gradient-based learning techniques. This work addresses align-ment as a technical-philosophical problem that requires solid philosophical foundations and practical implementations that bring normative theory to AI system development. To accomplish this, we propose two sets of neces-sary and sufficient conditions that, we argue, should be considered in any alignment process. While necessary conditions serve as metaphysical and metaethical roots that pertain to the permissibility of alignment, sufficient conditions establish a blueprint for aligning AI systems under a learning-based paradigm. After laying such foundations, we present implementa-tions of this approach by using state-of-the-art techniques and methods for aligning general-purpose language systems. We call this framework Dy-namic Normativity. Its central thesis is that any alignment process under a learning paradigm that cannot fulfill its necessary and sufficient conditions will fail in producing aligned systems.	por
dc.description.abstract	A investigação crítica que permeia o campo da filosofia, e talvez estenda sua influência a todas as disciplinas de ciências humanas, gira em torno dos meandros da moralidade e da normatividade. Surpreendentemente, nos últimos anos, esse fio temático foi inserido em um domínio inesperado, que não é convencionalmente associado à reflexão sobre "o que deve ser": o campo de pesquisa da inteligência artificial (IA). No centro da moralidade e da IA, encontramos o "alinhamento", um problema relacionado aos desafios de expressar metas e valores humanos de uma forma que os sistemas artificiais possam seguir sem causar efeitos adversos indesejados. De forma mais explícita e com nosso paradigma atual de desenvolvimento de IA em mente, podemos pensar no alinhamento como o ensino de valores humanos a entidades não antropomórficas treinadas por meio de técnicas de aprendizado opacas e baseadas em gradiente. Este trabalho aborda o alinhamento como um problema técnico-filosófico que requer fundamentos filosóficos sólidos e implementações práticas que tragam a teoria normativa para o desenvolvimento do sistema de IA. Para isso, propomos dois conjuntos de condições necessárias e suficientes que, segundo nosso argumento, devem ser consideradas em qualquer processo de alinhamento. Enquanto as condições necessárias servem como raízes metafísicas e metaéticas relacionadas à permissibilidade do alinhamento, as condições suficientes estabelecem um plano para alinhar os sistemas de IA sob um paradigma baseado em aprendizado. Depois de estabelecer essas bases, apresentamos implementações dessa abordagem usando técnicas e métodos de última geração para alinhar sistemas de linguagem de uso geral. Chamamos essa estrutura de Dinâmica Normativa. Sua tese central é que qualquer processo de alinhamento sob um paradigma de aprendizagem que não possa cumprir suas condições necessárias e suficientes falhará na produção de sistemas alinhados.	por
dc.description.provenance	Submitted by PPG Filosofia ([email protected]) on 2024-06-12T18:51:56Z No. of bitstreams: 1 NICHOLAS_KLUGE_CORREA_TES.pdf: 9914428 bytes, checksum: 7784ab9f35e9d5097c1504e84737c7e9 (MD5)	eng
dc.description.provenance	Approved for entry into archive by Sarajane Pan ([email protected]) on 2024-06-21T14:45:43Z (GMT) No. of bitstreams: 1 NICHOLAS_KLUGE_CORREA_TES.pdf: 9914428 bytes, checksum: 7784ab9f35e9d5097c1504e84737c7e9 (MD5)	eng
dc.description.provenance	Made available in DSpace on 2024-06-21T18:56:14Z (GMT). No. of bitstreams: 1 NICHOLAS_KLUGE_CORREA_TES.pdf: 9914428 bytes, checksum: 7784ab9f35e9d5097c1504e84737c7e9 (MD5) Previous issue date: 2024-05-13	eng
dc.description.sponsorship	Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES	por
dc.format	application/pdf	*
dc.thumbnail.url	https://tede2.pucrs.br/tede2/retrieve/191085/NICHOLAS_KLUGE_CORREA_TES.pdf.jpg	*
dc.language	eng	por
dc.publisher	Pontifícia Universidade Católica do Rio Grande do Sul	por
dc.publisher.department	Escola de Humanidades	por
dc.publisher.country	Brasil	por
dc.publisher.initials	PUCRS	por
dc.publisher.program	Programa de Pós-Graduação em Filosofia	por
dc.rights	Acesso Aberto	por
dc.subject	Artificial Intelligence	eng
dc.subject	Alignment	eng
dc.subject	Value Learning	eng
dc.subject	Inteligência Artificial	por
dc.subject	Alinhamento	por
dc.subject	Aprendizagem de Valor	por
dc.subject.cnpq	CIENCIAS HUMANAS::FILOSOFIA	por
dc.title	Dynamic normativity : necessary and sufficient conditions for value alignment	por
dc.type	Tese	por
dc.restricao.situacao	Trabalho não apresenta restrição para publicação	por
Appears in Collections:	Programa de Pós-Graduação em Filosofia

Files in This Item:

File	Description	Size	Format
NICHOLAS_KLUGE_CORREA_TES.pdf	NICHOLAS_KLUGE_CORRÊA_DIS	9.68 MB	Adobe PDF	Download/Open Preview ×

Show simple item record Recommend this item

PUCRS

Digital Library of Theses and Dissertations