Attribuer plusieurs colonnes en utilisant := dans data.table, par groupe

Question

Attribuer plusieurs colonnes en utilisant := dans data.table, par groupe

Demandé el 27 de Juillet, 2012: Quand la question a-t-elle été
105260 affichage: Nombre de visites la question a
2 Réponses: Nombre de réponses aux questions
Résolu: Situation réelle de la question

Quelle est la meilleure façon d'assigner à plusieurs colonnes en utilisant data.table ? Par exemple :

f <- function(x) {c("hi", "hello")}
x <- data.table(id = 1:10)

Je voudrais faire quelque chose comme ceci (bien sûr, cette syntaxe est incorrecte) :

x[ , (col1, col2) := f(), by = "id"]

Et pour aller plus loin, je peux avoir de nombreuses colonnes dont les noms sont stockés dans une variable (disons col_names ) et j'aimerais le faire :

x[ , col_names := another_f(), by = "id", with = FALSE]

Quelle est la manière correcte de faire quelque chose comme ça ?

Demandé el 27 de Juillet, 2012 par Alex

Answer 1

2 Réponses

Answer 2

190voto

Matt Dowle Points 20936

Cela fonctionne maintenant dans la v1.8.3 sur R-Forge. Merci de l'avoir mis en évidence !

x <- data.table(a = 1:3, b = 1:6) 
f <- function(x) {list("hi", "hello")} 
x[ , c("col1", "col2") := f(), by = a][]
#    a b col1  col2
# 1: 1 1   hi hello
# 2: 2 2   hi hello
# 3: 3 3   hi hello
# 4: 1 4   hi hello
# 5: 2 5   hi hello
# 6: 3 6   hi hello

x[ , c("mean", "sum") := list(mean(b), sum(b)), by = a][]
#    a b col1  col2 mean sum
# 1: 1 1   hi hello  2.5   5
# 2: 2 2   hi hello  3.5   7
# 3: 3 3   hi hello  4.5   9
# 4: 1 4   hi hello  2.5   5
# 5: 2 5   hi hello  3.5   7
# 6: 3 6   hi hello  4.5   9 

mynames = c("Name1", "Longer%")
x[ , (mynames) := list(mean(b) * 4, sum(b) * 3), by = a]
#     a b col1  col2 mean sum Name1 Longer%
# 1: 1 1   hi hello  2.5   5    10      15
# 2: 2 2   hi hello  3.5   7    14      21
# 3: 3 3   hi hello  4.5   9    18      27
# 4: 1 4   hi hello  2.5   5    10      15
# 5: 2 5   hi hello  3.5   7    14      21
# 6: 3 6   hi hello  4.5   9    18      27

x[ , get("mynames") := list(mean(b) * 4, sum(b) * 3), by = a][]  # same
#    a b col1  col2 mean sum Name1 Longer%
# 1: 1 1   hi hello  2.5   5    10      15
# 2: 2 2   hi hello  3.5   7    14      21
# 3: 3 3   hi hello  4.5   9    18      27
# 4: 1 4   hi hello  2.5   5    10      15
# 5: 2 5   hi hello  3.5   7    14      21
# 6: 3 6   hi hello  4.5   9    18      27

x[ , eval(mynames) := list(mean(b) * 4, sum(b) * 3), by = a][]   # same
#    a b col1  col2 mean sum Name1 Longer%
# 1: 1 1   hi hello  2.5   5    10      15
# 2: 2 2   hi hello  3.5   7    14      21
# 3: 3 3   hi hello  4.5   9    18      27
# 4: 1 4   hi hello  2.5   5    10      15
# 5: 2 5   hi hello  3.5   7    14      21
# 6: 3 6   hi hello  4.5   9    18      27

Version plus ancienne utilisant le with (nous décourageons cet argument lorsque cela est possible) :

x[ , mynames := list(mean(b) * 4, sum(b) * 3), by = a, with = FALSE][] # same
#    a b col1  col2 mean sum Name1 Longer%
# 1: 1 1   hi hello  2.5   5    10      15
# 2: 2 2   hi hello  3.5   7    14      21
# 3: 3 3   hi hello  4.5   9    18      27
# 4: 1 4   hi hello  2.5   5    10      15
# 5: 2 5   hi hello  3.5   7    14      21
# 6: 3 6   hi hello  4.5   9    18      27

Répondu el 6 de Octobre, 2012 par Matt Dowle (20936 Points )

Answer 3

73voto

Gerry Points 540

La notation sténographique suivante peut être utile. Tout le mérite revient à Andrew Brooks, notamment cet article .

dt[,`:=`(avg=mean(mpg), med=median(mpg), min=min(mpg)), by=cyl]

Répondu el 1 de Avril, 2018 par Gerry (540 Points )

Attribuer plusieurs colonnes en utilisant := dans data.table, par groupe

Réponses

Questions en vedette

Top Tags

Prograide.com

Powered by:

Attribuer plusieurs colonnes en utilisant := dans data.table, par groupe

Réponses

Questions en vedette

Top Tags

Dans notre réseau

Prograide.com

Powered by: