https://dblp.org/rdf/schema#authoredBy
|
https://dblp.org/pid/173/2639 +
, https://dblp.org/pid/192/7541 +
, https://dblp.org/pid/372/4272 +
, https://dblp.org/pid/372/3725 +
, https://dblp.org/pid/62/7328 +
, https://dblp.org/pid/221/3556 +
, https://dblp.org/pid/372/3406 +
, https://dblp.org/pid/372/3216 +
, https://dblp.org/pid/188/9156 +
, https://dblp.org/pid/228/2536 +
, https://dblp.org/pid/348/4544 +
, https://dblp.org/pid/75/10470 +
, https://dblp.org/pid/282/0561 +
, https://dblp.org/pid/b/WilliamJByrne +
, https://dblp.org/pid/209/9871 +
, https://dblp.org/pid/193/1513 +
, https://dblp.org/pid/64/6025 +
, https://dblp.org/pid/11/1520 +
, https://dblp.org/pid/39/6853 +
|
https://dblp.org/rdf/schema#bibtexType
|
http://purl.org/net/nknouf/ns/bibtex#Article +
|
https://dblp.org/rdf/schema#documentPage
|
https://doi.org/10.48550/ARXIV.2403.10704 +
|
https://dblp.org/rdf/schema#doi
|
https://doi.org/10.48550/ARXIV.2403.10704 +
, http://dx.doi.org/10.48550/ARXIV.2403.10704 +
|
https://dblp.org/rdf/schema#listedOnTocPage
|
https://dblp.org/db/journals/corr/corr2403 +
|
https://dblp.org/rdf/schema#numberOfCreators
|
19
|
https://dblp.org/rdf/schema#primaryDocumentPage
|
https://doi.org/10.48550/ARXIV.2403.10704 +
|
https://dblp.org/rdf/schema#publishedIn
|
CoRR
|
https://dblp.org/rdf/schema#publishedInJournal
|
CoRR
|
https://dblp.org/rdf/schema#publishedInJournalVolume
|
abs/2403.10704
|
https://dblp.org/rdf/schema#title
|
PERL: Parameter Efficient Reinforcement Learning from Human Feedback.
|
https://dblp.org/rdf/schema#yearOfPublication
|
2024
|
owl:sameAs |
https://doi.org/10.48550/ARXIV.2403.10704 +
, http://dx.doi.org/10.48550/ARXIV.2403.10704 +
|
rdf:type |
https://dblp.org/rdf/schema#Publication +
, https://dblp.org/rdf/schema#Informal +
|
rdfs:label |
Hakim Sidahmed et al.: PERL: Parameter Efficient Reinforcement Learning from Human Feedback. (2024)
|