Tag: Data Analyst

การวิเคราะห์ข้อมูล จะช่วยให้เราหาข้อมูลเชิงลึกในธุรกิจเพื่อพัฒนาธุรกิจให้ดีขึ้นได้จากข้อมูลที่เรามี

10 Topics of Data analysis of the UCI Machine Learning Repository’s Online Retail dataset using R

เป็นบทความสำหรับการวิเคราะห์ข้อมูลโดยโปรแกรม R เพื่อที่จะหากลุ่มลูกค้าที่จะเก็บข้อมูลมาว่าลูกค้ากลุ่มไหน ควรออกแบบ Campaign อะไรที่จะสามารถตอบโจทย์และบอกได้ว่าลูกค้ากลุ่มไหนควรแนะนำให้เพิ่มบริการหรือลดบริการ

เนื่องจากปัจจุบันการสูญเสียลูกค้ากันเยอะขึ้น ในหลายๆ รูปแบบจึงอยากหาสาเหตุว่าลูกค้ากลุ่มไหนควรที่จะทำ Campaign อะไรให้เพื่อฟื้นฟูความสัมพันธ์กับลูกค้าเหล่านั้น โดยการสร้าง RFM Feature Engineering ขึ้นมาจากข้อมูลที่มี

RFM Feature Engineering คือมันคือการแปลงประวัติการซื้อของลูกค้า (ที่เป็นข้อมูลดิบ) ให้กลายเป็น 3 คำดังนี้ Recency, Frequency and Monetary เพื่อนำไปใช้สำหรับการวิเคราะห์การตลาด จากข้อมูลดิบของการซื้อขายครับ คือ การเปลี่ยนข้อมูล Transaction ที่เข้าใจยาก ให้กลายเป็น 3 คอลัมน์ใหม่ที่เข้าใจง่าย เพื่อใช้วิเคราะห์พฤติกรรมลูกค้าครับ

โดย 3 Features ที่เราสร้างขึ้นมานั้นย่อมาจาก:

R = Recency (ความสดใหม่): ลูกค้าซื้อครั้งล่าสุดเมื่อไหร่? (เช่น 10 วันที่แล้ว)
F = Frequency (ความถี่): ลูกค้าซื้อบ่อยแค่ไหน? (เช่น 5 ครั้ง)
M = Monetary (มูลค่าการใช้จ่าย): ลูกค้าใช้เงินไปทั้งหมดเท่าไหร่? (เช่น 8,000 บาท)

Data Analyst with R

Install Library

ก่อนที่จะเริ่มต้นวิเคราะห์ข้อมูล ก็เริ่มต้นด้วยการ Download Library เพื่อที่จะสามารถใช้ Function ต่างๆได้หลากหลายขึ้น เช่น tidyverse และ readxl ก่อน

Install readxl

install.packages(“readxl”) download มาเพื่อสามารถอ่านข้อมูลจากไฟล์ Excel ได้
library(readxl)

install.packages("readxl")
library(readxl)

Install tidyverse

install.packages(“tidyverse”) Download มาเพื่อสามารถวิเคราะห์ข้อมูลทั้งหมดได้ ไม่ว่าจะ dplyr และ ggplot
library(tidyverse)

install.packages("tidyverse")
library(tidyverse)

หลังจาก run code จะสามารถดึงอุปกรณ์ในการช่วยวิเคราะห์ข้อมูลได้ตามรูปด้านล่าง

Read data

Read excel

สามารถอ่าน File Excel ชื่อ Online Retail.xlsx แล้วเลือก Sheet ที่ต้องการวิเคราะห์ข้อมูล ซึ่งก็คือ Sheet ที่ 1 ของไฟล์นี้

## read file from excel
retail_data <- read_excel("Online Retail.xlsx", sheet = 1)
View(retail_data)

Head data

เลือก head มาเพื่อที่จะดูข้อมูล Column ของมีข้อมูลแถวบนเป็นยังไงบ้าง

## show head data
print("Example:")
print(head(retail_data))

Glimpse

glimpse จะแสดงโครงสร้างข้อมูลของ data frame จะสามารถรู้ได้ว่ามีคอลัมน์อะไรบ้าง, แต่ละคอลัมน์มีชนิดข้อมูล (Data Type) อะไร, และมีข้อมูลตัวอย่างหน้าตาเป็นอย่างไร ดังรูป

## show data structure
print("data structure:")
glimpse(retail_data)

Definition of Column

เราเริ่มจากการดูข้อมูลเบื้องต้นก่อนว่า แต่ละ Column คืออะไรบ้าง

InvoiceNo

InvoiceNo คือ (เลขที่ใบแจ้งหนี้)
เป็นรหัสที่ใช้ “จัดกลุ่ม” สินค้าที่ถูกซื้อในธุรกรรม (Transaction) เดียวกัน
จากตัวอย่าง InvoiceNo “536365” มีหลายแถว หมายความว่า ลูกค้าคนนี้สั่งสินค้าหลายอย่างในใบเสร็จใบเดียวกัน

StockCode

StockCode คือ รหัสสินค้า
รหัสเฉพาะของสินค้าแต่ละชิ้น (คล้ายกับ SKU)

Description

Description คือ รายละเอียดสินค้า
ชื่อหรือคำอธิบายของสินค้า (เช่น “WHITE HANGING HEART T-LIGHT HOLDER”)

Quantity

Quantity คือ จำนวนสินค้า
จำนวนสินค้า ชิ้นนั้น ที่ถูกสั่งซื้อในใบเสร็จนี้
ข้อควรระวัง: ในข้อมูลชุดนี้ บางครั้งค่า Quantity อาจ ติดลบ ซึ่งหมายถึงการยกเลิก (Cancellation) หรือการคืนสินค้า (Return)

InvoiceDate

InvoiceDate คือ วันที่สั่งซื้อ
วันที่และเวลาที่ธุรกรรมนั้นเกิดขึ้น (เช่น 2010-12-01 08:26:00)

UnitPrice

UnitPrice คือ ราคาต่อหน่วย
ราคาของสินค้าชิ้นนั้น 1 หน่วย (เช่น 2.55)

CustomerID

CustomerID คือ รหัสลูกค้า
รหัสประจำตัวของลูกค้าที่ทำการสั่งซื้อ (เช่น 17850)
ข้อควรระวัง: ในข้อมูลชุดนี้ บางแถวอาจไม่มี CustomerID (เป็นค่าว่าง หรือ NA) ซึ่งหมายถึงการซื้อแบบที่ไม่ได้ล็อกอิน (Guest)

Country

Country คือ ประเทศ
ประเทศที่ลูกค้าคนนั้นอาศัยอยู่

Data Cleaning

Clean missing CustomerID value

หลังจากสำรวจข้อมูลแล้วเห็นว่าข้อมูล CustomerID บาง Column ข้อมูลไม่ครบ จึงทำลบแถวที่ข้อมูลไม่ครบของ CustomerID จาก Rows 541,909 เหลือ 406,829 = 133,820 Rows
ใช้ Code นี้เพื่อที่จะสามารถข้อมูลที่ยังไม่สมบูรณ์ เช่น ที่ Column Customer ID ออกโดยใช้ filter(!is.na(CustomerID)) คือ ใช้ column CustomerID ‘ไม่’ ( ! ) ‘เป็นค่าว่าง’ ( is.na )”

## Clean missing data
retail_data_cleaned <- retail_data %>%
  filter(!is.na(CustomerID))
glimpse(retail_data_cleaned)

Cancel transaction that have C before Invoice NO.

หลังจากสำรวจข้อมูลใน Column Invoice NO. แล้วพบว่า Column ที่ Invoice NO. ถูก Cancel จะมีตัว C อยู่ข้างหน้า Invoice เหล่านั้น
เราจึงต้องกรอง Invoice ที่ขึ้นต้นด้วย C และ c ออกไปเพื่อเหลือแค่ลูกค้าที่สั่ง Order กับเราจริงๆ โดยไม่ยกเลิก Order

## Cancel transaction that have C before Invoice NO.
retail_data_cleaned <- retail_data_cleaned %>%
  filter(!startsWith(as.character(InvoiceNo), "C") & !startsWith(as.character(InvoiceNo), "c"))
glimpse(retail_data_cleaned)

**Cancel transaction that have C before Invoice NO.**

Manage Quantity and UnitPrice

กรองค่าที่ Quantity และ UnitPrice ที่น้อยกว่า 0 ออกเพื่อให้ข้อมูลถูกต้อง

## manage Quantity and UnitPrice
retail_data_cleaned <- retail_data_cleaned %>%
  filter(Quantity > 0 & UnitPrice > 0)
glimpse(retail_data_cleaned)

Create Column TotalPrice

เพิ่ม Column TotalPrice เพื่อคำนวณราคาของ Quantity * Unitprice จะได้รู้ปริมาณ * ราคาของสินค้าทั้งแถว
แล้วจะมี Column ชื่อ Total Price ตามรูปด้านล่าง

## Create Column Totalprice
retail_data_cleaned <- retail_data_cleaned %>%
  mutate(TotalPrice = Quantity * UnitPrice)
retail_data_cleaned
glimpse(retail_data_cleaned)

Change InvoiceDate into Date/time

เปลี่ยนวันที่ในข้อมูลให้กลายเป็นวันที่สามารถบอกเวลาได้ ให้เป็นรูปแบบเดียวกัน

## Change InvoiceDate into Date/time
retail_data_cleaned <- retail_data_cleaned %>%
  mutate(InvoiceDate = ymd_hms(InvoiceDate))
glimpse(retail_data_cleaned)

Summary Data

สรุปข้อมูลออกมาได้ดังนี้
Summary ได้เฉพาะ Column ที่เป็นปริมาณ

Create RFM Feature Engineering

Snapshot Date

หาวันที่ max ที่สุดของ data นี้ด้วยตัวแปร Snapshot

## snapshot Date
## use next date for last day from data
snapshot_date <- max(retail_data_cleaned$InvoiceDate) + days(1)
snapshot_date

calculate RFM

สร้างแถว Recency, Frequency and Monetary

เพื่อจะได้รับตัวแปรช่วยให้รู้ได้ว่าลูกค้ากลุ่มไหนซื้อสินค้าเรา วันล่าสุดเท่าไร ความถี่เท่าไร และค่าใช้จ่ายเท่าไร

Monetary (M): ยอดใช้จ่ายทั้งหมด
Recency (R): จำนวนวันที่ผ่านไปนับจากการซื้อครั้งล่าสุด
Frequency (F): จำนวนธุรกรรมทั้งหมด

## create new rfm_data with Recency, Frequency and Monetary
rfm_data <- retail_data_cleaned %>%
  group_by(CustomerID) %>%
  summarise(
    Recency = as.numeric(difftime(snapshot_date, max(InvoiceDate), units = "days")),
    Frequency = n_distinct(InvoiceNo),
    Monetary = sum(TotalPrice)
  )
print("Example RFM:")
print(head(rfm_data))
View(rfm_data)

K-Means (Customer Segmentation)

K-Means

K-Means คือการแบ่งฐานลูกค้าทั้งหมดของคุณออกเป็นกลุ่มย่อยๆ (Segments) โดยที่คนในกลุ่มเดียวกันจะมีพฤติกรรมหรือคุณลักษณะที่คล้ายกัน แต่จะแตกต่างจากคนในกลุ่มอื่นอย่างชัดเจน

ตัวอย่างผลลัพธ์ที่คาดว่าจะได้รับมีดังนี้

เมื่อใช้อัลกอริทึม K-Means (สมมติว่าเราตั้งค่า $K=4$) เราอาจจะได้กลุ่มลูกค้า 4 กลุ่ม เช่น:

กลุ่มลูกค้าชั้นดี (High-Value): ซื้อบ่อย (F สูง), ยอดซื้อสูง (M สูง), และเพิ่งซื้อไปไม่นาน (R ต่ำ)
กลุ่มลูกค้าที่กำลังจะหาย (At-Risk): เคยซื้อเยอะและบ่อย (F, M สูง) แต่ไม่กลับมาซื้อนานแล้ว (R สูง)
กลุ่มลูกใหม่ (New Customers): เพิ่งซื้อครั้งแรก (F, M ต่ำ) และซื้อล่าสุด (R ต่ำ)
กลุ่มลูกค้าทั่วไป (Standard): ซื้อประปราย ยอดซื้อปานกลาง

Prepare Data for K-Means (choose specific column R, F, M)

เลือกเฉพาะ Column R, F และ M จาก ตัวแปร rfm_data มาอยู่ในตัวแปร rfm_for_clustering เพื่อที่จะสามารถศึกษาข้อมูลต่อได้

## Prepare data for K-Means (Choose specially R, F, M )
rfm_for_clustering <- rfm_data %>%
  select(Recency, Frequency, Monetary)
rfm_for_clustering
View(rfm_for_clustering)

Manage with Outlier

เนื่องจาก Frequency and Monetary มีการเบ้ขวาของข้อมูล จึงใส่ค่า log เพื่อลดการคลาดเคลื่อนของข้อมูล (Outlier)
การเบ้ขวาของข้อมูล คือ ข้อมูลส่วนใหญ่กระจุกตัวอยู่ที่ฝั่งค่าน้อยกว่า

## manage with Outliers 
## column Frequency and Monetary have right skewed
## Log Transformation to reduce Outlier
rfm_log <- rfm_for_clustering %>%
  mutate(
    Recency_log = log(Recency + 1), # +1 เพื่อหลีกเลี่ยง log(0)
    Frequency_log = log(Frequency + 1),
    Monetary_log = log(Monetary + 1)
  ) %>%
  select(Recency_log, Frequency_log, Monetary_log)
glimpse(rfm_log)

Standardize

Scale( ) ใน R เป็นเครื่องมือที่สำคัญมากสำหรับการ “Standardization” หรือ “การปรับสเกลข้อมูล” ครับ

Centering (การปรับศูนย์): มันจะนำค่าในคอลัมน์นั้นไป ลบด้วยค่าเฉลี่ย (Mean) ของคอลัมน์ ผลลัพธ์คือ คอลัมน์ใหม่นี้จะมี ค่าเฉลี่ย = 0
Scaling (การปรับสเกล): จากนั้น มันจะนำค่าที่ถูก Centered แล้ว ไป หารด้วยส่วนเบี่ยงเบนมาตรฐาน (Standard Deviation – SD) ของคอลัมน์นั้น ผลลัพธ์คือ คอลัมน์ใหม่นี้จะมี Standard Deviation = 1

## Standardize
## make average to be 0 and S.e. to be 1
rfm_scaled <- scale(rfm_log)
print("Adapt with propotion:")
print(head(rfm_scaled))

View(rfm_scaled)

Elbow method

Elbow Method คือเทคนิคที่นิยมใช้เพื่อช่วยตัดสินใจว่า “จำนวนกลุ่ม (K) ที่เหมาะสมที่สุด” ควรจะเป็นเท่าไหร่ สำหรับการทำ Clustering, โดยเฉพาะกับ K-Means

## Elbow method to find K that fit to data
wss <- (nrow(rfm_scaled)-1) * sum(apply(rfm_scaled, 2, var))
for (i in 2:10) { # ทดสอบ k ตั้งแต่ 2 ถึง 10
  wss[i] <- sum(kmeans(rfm_scaled, centers = i)$withinss)
}
wss

WSS

wss คือ มันคือการคำนวณว่าข้อมูลทั้งหมดในกลุ่มนั้นๆ อยู่ “กระจัดกระจาย” หรือ “เกาะกันแน่น” แค่ไหน โดยวัดจากจุดศูนย์กลาง (Centroid) ของกลุ่ม

ค่า WSS ต่ำ = ดีมาก

หมายความว่า จุดข้อมูลต่างๆ อยู่ “ใกล้” กับจุดศูนย์กลางของกลุ่มมันมาก
แปลว่ากลุ่มนั้น “เกาะกันแน่น” (Dense) และมีความแปรปรวนภายในกลุ่มต่ำ

ค่า WSS สูง = ไม่ดี

หมายความว่า จุดข้อมูลต่างๆ อยู่ “ไกล” จากจุดศูนย์กลางกลุ่ม
แปลว่ากลุ่มนั้น “กระจัดกระจาย” (Sparse) และมีความแปรปรวนภายในกลุ่มสูง

## Calculate Within-Cluster Sum of Squares (WSS)
wss <- (nrow(rfm_scaled)-1) * sum(apply(rfm_scaled, 2, var))
for (i in 2:10) { # ทดสอบ k ตั้งแต่ 2 ถึง 10
  wss[i] <- sum(kmeans(rfm_scaled, centers = i)$withinss)
}
wss

วิธีดู “Elbow method” คือการดูว่า WSS “ลดลง” ไปเท่าไหร่ในแต่ละก้าว และมองหาจุดที่ “อัตราการลดลง” มันเริ่มน้อยลง (กราฟเริ่มแบน)

K=1 -> 2: ลดลง 13011.0 – 6472.9 = 6538.1 (ลดลงเยอะมาก)
K=2 -> 3: ลดลง 6472.9 – 4855.5 = 1617.4 (ยังลดลงเยอะ)
K=3 -> 4: ลดลง 4855.5 – 3923.4 = 932.1 (เริ่มลดน้อยลง อย่างชัดเจน)
K=4 -> 5: ลดลง 3923.4 – 3267.4 = 656.0
K=5 -> 6: ลดลง 3267.4 – 2840.2 = 427.2
K=6 -> 7: ลดลง 2840.2 – 2533.5 = 306.7 (หลังจากนี้คือลดลงน้อยมาก)
K=7 -> 8: ลดลง 2533.5 – 2330.2 = 203.3
K=8 -> 9: ลดลง 2330.2 – 2162.1 = 168.1
K=9 -> 10: ลดลง 2162.1 – 1997.5 = 164.6

จึงใช้ k = 4

K=3 -> 4: ลดลง 4855.5 – 3923.4 = 932.1 (เริ่มลดน้อยลง อย่างชัดเจน)

Create dataframe for plot graph and plot graph with ggplot

สร้างกราฟเพื่อดูว่าข้อมูลไหนห่างกันน้อยที่เมื่อเทียบกับด้านคือ K = 4

## create dataframe for plot graph
elbow_data <- data.frame(k = 1:10, wss = wss)

## plot graph with ggplot
print(
  ggplot(elbow_data, aes(x = 4, y = wss)) +
    geom_line() +
    geom_point() +
    scale_x_continuous(breaks = 1:10) +
    labs(title = "Elbow Method for Optimal 4",
         x = "Amount (4)",
         y = "Within-Cluster Sum of Squares (WSS)")
)

K-Means clustering

## K-Means clustering
set.seed(42) # make result same
k_optimal <- 4 
kmeans_result <- kmeans(rfm_scaled, centers = k_optimal, nstart = 25)

set.seed(42) เพื่อให้ข้อมูลคงค่าเดิมเสมอทุกครั้งที่ Run model
centers = k_optimal: บอก K-Means ว่า “ให้แบ่งกลุ่มข้อมูลนี้ออกเป็น 4 กลุ่มนะ” (โดยอ้างอิงค่าจากตัวแปร k_optimal ที่เราตั้งไว้)
nstart = 25: บอก K-Means ว่า “ให้ลองสุ่มจุดเริ่มต้น 25 ครั้ง แล้วเลือกเอาครั้งที่ได้ผลลัพธ์ดีที่สุด (คือได้ค่า WSS ต่ำที่สุด)” มาเป็นคำตอบสุดท้าย (ช่วยให้ได้ผลลัพธ์ที่ดีและเสถียรขึ้น)

Segment Profiling

Calculate average of R, F, M with Cluster

คำนวณค่าเฉลี่ยตามกลุ่ม Cluster เรียงตามค่าใช้จ่ายจากน้อยไปมาก

## Calculate average of R, F, M with Cluster
segment_profile <- rfm_data %>%
  group_by(Cluster) %>%
  summarise(
    Avg_Recency = mean(Recency),
    Avg_Frequency = mean(Frequency),
    Avg_Monetary = mean(Monetary),
    Count = n() # Number of Customers
  ) %>%
  arrange(Avg_Monetary) # arrange with expense

print("Profie seperate of group R, F, M:")
print(segment_profile)

Storytelling and Visualization

Bar charts to compare average R, F, M of each segment

## Bar charts to compare average R, F, M of each segment

segment_profile_long <- segment_profile %>%
  select(Cluster, Avg_Recency, Avg_Frequency, Avg_Monetary) %>%
  gather(key = "Metric", value = "Value", -Cluster)

print(
  ggplot(segment_profile_long, aes(x = Cluster, y = Value, fill = Metric)) +
    geom_bar(stat = "identity", position = "dodge") +
    facet_wrap(~ Metric, scales = "free_y") +
    labs(title = "Segment Profiles (Average RFM Values)",
         x = "Cluster",
         y = "Average Value") +
    theme_minimal()
)

Analyst from four cluster

Cluster 1

Cluster 1: ลูกค้าทั่วไป (กำลังจะห่าง)

Frequency (ความถี่): ปานกลาง (Avg. ~4)
Monetary (ยอดใช้จ่าย): ปานกลาง (Avg. ~1900)
Recency (ซื้อล่าสุด): ค่อนข้างนาน (Avg. ~70 วัน)
สรุป: กลุ่มนี้เคยซื้อค่อนข้างดี แต่เริ่มหายไปนานแล้ว (70 วัน) อาจต้องการการกระตุ้นเตือนให้กลับมา

Cluster 2

Cluster 2: 🏆 ลูกค้าชั้นดี (Best Customers / VIP)

Frequency (ความถี่): สูงที่สุด (Avg. ~14)
Monetary (ยอดใช้จ่าย): สูงที่สุด (Avg. ~8000)
Recency (ซื้อล่าสุด): ต่ำที่สุด (Avg. ~10 วัน)
สรุป: นี่คือกลุ่มที่ดีที่สุดของคุณ ซื้อบ่อย, จ่ายหนัก, และเพิ่งซื้อไปไม่นาน กลุ่มนี้คือกลุ่มที่ต้องรักษาไว้ให้ดีที่สุด (Loyalty Program, สิทธิพิเศษ)

Cluster 3

Cluster 3: 😥 ลูกค้าที่หายไปแล้ว (Lost Customers)

Frequency (ความถี่): ต่ำ (Avg. ~1.5)
Monetary (ยอดใช้จ่าย): ต่ำที่สุด (Avg. ~300)
Recency (ซื้อล่าสุด): สูงที่สุด (Avg. ~180 วัน)
สรุป: กลุ่มนี้ซื้อน้อย จ่ายน้อย และที่สำคัญคือ ไม่กลับมาซื้อนานมากแล้ว (เกือบ 180 วัน) การดึงลูกค้ากลุ่มนี้กลับมาอาจต้องใช้โปรโมชั่นที่แรงมาก (Win-back campaign)

Cluster 4

Cluster 4: ✨ ลูกค้าใหม่ (New Customers)

Frequency (ความถี่): ต่ำ (Avg. ~2)
Monetary (ยอดใช้จ่าย): ต่ำ (Avg. ~600)
Recency (ซื้อล่าสุด): ต่ำ (Avg. ~20 วัน)
สรุป: กลุ่มนี้เพิ่งเข้ามาซื้อได้ไม่นาน (Recency ต่ำ) แต่ยังซื้อไม่บ่อยและยังจ่ายไม่เยอะ (F, M ต่ำ) เป้าหมายคือต้องกระตุ้น (Nurture) ให้พวกเขากลายเป็น Cluster 2 ในอนาคต

Recommended Campaign

Cluster	Segment	จำนวนลูกค้า	กลยุทธ์ที่แนะนำ
1	At-Risk	1158	ดึงกลับ ส่งแคปเปญ We miss you
2	Champions	723	รักษา มอบรางวัล loyalty ให้สิทธิ์ VIP
3	Lost	1579	ไม่ต้องโฟกัส
4	Potential	878	พัฒนา กระตุ้นการซื้อถัดไป

Export Data for search cluster in excel

install.packages("writexl")
library(writexl)
write_xlsx(rfm_data, "rfm_data_export.xlsx")

install package write excel เพื่อที่จะสามารถนำไปดูต่อใน Excel ได้ว่า Customer ID ควรสร้าง Campaign อะไร
Loyalty for VIP, We miss you สำหรับลูกค้าที่จะหายไป, Potential ที่พัฒนาการกระตุ้นซื้อครั้งถัดไป, Lost ไม่ต้องโฟกัสเยอะ แล้วให้ไปโฟกัสลูกค้ากลุ่มอื่นๆ

rfm_data_export Download

Github :

ดูตัวอย่าง code ทั้งหมดได้ที่ https://github.com/Chayanonboo/code-for-articles/blob/main/code_R/Online_Retail_Data_Set_from_UCI_ML_repo30_10_2025.ipynb

Reference :

https://www.kaggle.com/datasets/jihyeseo/online-retail-data-set-from-uci-ml-repo

October 31, 2025

Create Resume With Economics and Psychology

เป็นบทความที่เขียนเพื่อแนะนำวิธีการเขียน resume โดยใช้หลัก Economics and Psychology ในการช่วยเขียนเพื่อให้มีโอกาสที่จะถูกเรียกในสัมภาษณ์ได้มากขึ้น พร้อมกับ การแนะนำลำดับการเขียน Contact, Career Objective, Experience, Education, Skills + Certification, Portfolio / Project เพื่อให้ HR อ่านเข้าใจง่ายแล้วเรียกสัมภาษณ์ง่ายขึ้นครับ

Create Resume

The Market View

ในมุมการตลาดยุคปัจจุบัน ทุกคนรู้สึกว่างานยากขึ้น

จริงๆ แล้วปัจจุบันงานไม่ได้หางานยากเลย งานหาง่ายเพราะมีช่องทางเปิดให้สมัครงาน ตามอินเตอร์เน็ตได้เยอะและส่งใบสมัครกันได้รวดเร็วยิ่งขึ้น
ถ้าทุกคนอยากทำจะมีงานให้ทำเลย ถ้าเข้าใจ The market

The Market คือ ตลาดที่เกิดการแลกเปลี่ยนหรือซื้อ-ขาย กันทั้งใน Real or Online World

โดยผู้สมัครงาน สามารถหางานได้ทั้ง Platform Online หรือ Offline โดยที่เรามี Skill ที่เหมาะสมกับงานเหล่านั้น เช่น E-Commerce ตลาดนัด และ Shopping Mall

Labour market

ตลาดแรงงานเป็นตลาดพิเศษ โดยที่สินค้าที่เราขายตอนนี้ คือ ตัวเราที่ขายเวลาของเราในแต่ละวัน เพื่อที่จะได้ไปทำงานในหน่วยงานหนึ่งๆ หรือ พนักงานออฟฟิศ

Type	Definition	Example
Supply	อุปสงค์แรงงาน	คนสมัครงานตามบริษัท
Demand	อุปทานแรงงาน	บริษัทที่พร้อมจ้างงานเรา

Equilibrium จุดตัดที่ Supply กับ Demand ตัดกัน

A change in supply

สิ่งที่กำลังเกิดขึ้นในตอนนี้คือ ผู้สมัครงานมีจำนวนมากขึ้น

แปลว่า ปัจจุบันมีแรงงานทำงานมากยิ่งขึ้น
โดย แรงงานปัจจุบัน ไม่เหมือนกับเมื่อ 3 ปีที่แล้ว เนื่อง AI

Labour	Definition
Past 3 Year	People
Now	Bot + People

โดย Supply ในหลายๆ ตลาดเพิ่มขึ้น เพราะ ในปัจจุบันมี Concept AI Agent ที่สามารถช่วยทำงานให้เราได้มากยิ่งขึ้น

Short term

W1 จะเกิด Supply > Demand

ตลาดแรงงานจะก็ต้องปรับตัวด้วยการดันราคาตลาดลงมา เพราะคนอยากทำงานมากกว่าความต้องการตลาดจะเกิด Oversupply สิ่งที่ตลาดจะทำคือปรับเงินเดือนต่ำลง เพื่อให้ Demand สูงขึ้น แล้วมา Match ที่จุดดุลยภาพจุด W2 ใหม่

พอมี Robot เข้ามาจะทำให้หลายองค์กรไม่อยากจ่ายเงินเดือนสูงแล้ว เพราะสามารถจ้าง Robot ได้

Relevance AI

เช่น Website Relevance(AI) : https://relevanceai.com/

Define Role and Objective

สามารถใช้ Agentic Chatbot สำหรับวิเคราะห์เกี่ยวกับ Data Analyst ได้
สามารถสร้าง Chatbot แล้วกำหนดคุณสมบัติ Chatbot ให้เราได้ดังนี้เลย

Analyze Our Profile

รู้ข้อมูลเกี่ยวกับเราหมดเลย เช่น Analyzed your LinkedIn Profile (nice W3Schools certification)
หลังสมัคร Program Relevance AI Bot ก็ไปดึงข้อมูลจาก Profile เราแล้วแจ้งข้อมูลมาดังนี้

Analyze Our Profile

Low Elastic vs High Elastic

ความยืดหยุ่นน้อย คือ Demand กราฟชันมากจะยืดหยุ่นน้อย
ความยืดหยุ่นมาก คือ Demand กราฟชันน้อยจะยืนหยุ่นมาก

Low Elastic demand vs High Elastic Demand

Market	Job
Low Elastic Demand	Call center, Porter
High Elastic Demand	Data

เราควรเป็นเหมือนตลาดด้านขวา เพราะ W2 ลดลงน้อยกว่า เพราะหากเราทำงานได้หลาย Skill คนจะชอบจ้างมากกว่า คนที่ทำงานได้แค่ Skill เดียว
บางที นายจ้างไปใช้ AI แทน จึงต้องลดเงินเดือนพนักงานลงมา

เงินเดือนเราจะถูกกระทบจาก AI หรือเปล่าขึ้นอยู่กับว่าเราอยู่ในตลาดไหน

เราควรอยู่ในตลาดที่ 4-5 ปีจะมีคนมาแทนเรายาก และ เทคโนโลยีไม่สามารถแทนที่เราได้ ใน ตลาดยืนหยุ่นสูง

How to find work easy

งานหาง่าย ถ้าเราายอมลดเงินเดือนตัวเองลงมา เช่น ถ้างานในตลาด 5 หมื่น เรายอมที่จะรับ 3 หมื่น จะทำให้เราได้งานง่ายขึ้น

งานจะหายากเลย ถ้าเกิดเราอยากได้งานที่ราคาเกินตลาด เพราะการแข่งขันสูง

ต้องหาวิธีทำยังไงดี ให้บริษัทอยากเรียกเราไป สัมภาษณ์มากขึ้น และได้เงินเยอะขึ้นด้วย

ในฐานะ แรงงานคนนึง ควรจะเป็นคนที่ยืดหยุ่นสูง เพราะจะสามารถ Switch งานไปทำอีกสายหนึ่งได้ง่ายขึ้น

Asymmetric Information

Asymmetric Information เป็นแนวคิดทางเศรษฐศาสตร์ที่เกิดขึ้นเมื่อ ฝ่ายใดฝ่ายหนึ่งในการทำธุรกรรมหรือข้อตกลง มีข้อมูลที่มากกว่าหรือดีกว่าอีกฝ่ายหนึ่งอย่างมีนัยสำคัญ ทำให้เกิดความไม่เท่าเทียมกันในด้านข้อมูล เช่น โดยที่มีคนที่รู้ข้อมูลมากกว่ากับคนที่รู้ข้อมูลน้อยกว่า

– George Akerlof เป็นคนคิดทฤษฎีนี้ขึ้นมา

คนที่มี Insider ข้อมูลก็มีโอกาสเข้าใจได้ดีกว่า คนที่ไม่มีความรู้เกี่ยวกับตลาดหุ้นเลย โดยจะมีการพูดถึงว่า คนที่รู้น้อยกว่า จะมีเทคนิคยังไงได้บ้างที่จะทำให้คนเหล่านั้นรู้มากขึ้นได้ กับ คนที่รู้มากกว่า จะมีเทคนิคยังไงได้บ้างที่จะทำให้คนเหล่านั้นรู้น้อยขึ้นได้รับข้อมูลจากคนที่รู้ได้มากขึ้น

Signaling vs Screening

วิธีส่งข้อมูลมี 2 แบบ

ถ้ารู้มากกว่าให้ใช้การ Signaling ข้อมูล

ผู้มีข้อมูลมากกว่า พยายามกระทำการบางอย่างเพื่อ ส่งสัญญาณ หรือแสดงให้ผู้มีข้อมูลน้อยกว่า

2. ถ้ารู้น้อยกว่าให้ใช้การ Screening ข้อมูล

ผู้มีข้อมูลน้อยกว่า สร้างกลไกหรือเงื่อนไขบางอย่างขึ้นมาเพื่อ คัดกรอง หรือทำให้ผู้มีข้อมูลมากกว่า เปิดเผยข้อมูลส่วนตัวของตนเองออกมา

**Candidate signaling, Company Screening**

Screening เช่น หน้าที่บริษัทคือการ Screening Resume จาก 100 คนเหลือ 10 คน
Signaling เช่น ผู้สัมภาษณ์ส่งข้อมูลเพื่อให้บริษัทรู้เกี่ยวกับประวัติผู้สัมภาษณ์งานมากที่สุด เพื่อที่จะมีโอกาสได้รับการจ้างงานมากขึ้น

Resume คือเครื่องมือที่ใช้ในการส่งสัญญาณให้ผู้สัมภาษณ์รู้ว่าเรามีความสามารถ

Resume ควรเลือก Certification ที่เหมาะกับแต่ละบริษัทที่เราสมัครไปด้วย

Case 1 : Lemon Market

พูดถึงตลาดรถยนต์ มือสอง Lemon ผู้ขายมีความรู้หรือข้อมูลเกี่ยวกับคุณภาพของสินค้ามากกว่าผู้ซื้อ

เจ้าของเต้นท์ (ผู้ขาย) รถยนต์จะรู้เรื่องข้อมูลมากกว่าลูกค้า (ผู้ซื้อ)

ปัญหา ผู้ซื้อจะโดนโก่งราคา เช่น ผู้ซื้อมาที่ราคา 300,000 บาท ทั้งที่ราคาจริงคือ 150,000 บาท
ผู้ขายอาจจะมีเทคนิคในการดันราคารถขึ้นมาเพื่อให้ขายได้กำไรเยอะสุด
คนซื้ออยากโดยที่จ่ายเงินน้อยที่สุด
Fight between Supply and Demand

“โดนคนที่รู้มากกว่าจะมีความได้เปรียบกว่า” George A. Akerlof

Case 2 : Interview

นึกถึงเวลาที่ไปสัมภาษณ์งาน
ระหว่าง เรา กับ บริษัท ใครจะรู้ได้มากกว่ากันว่า ใครสามารถทำงานได้

ต้องมี Job Description ถึงจะรู้ว่า เราเหมาะหรือไม่เหมาะที่จะทำงานเหล่านั้น
Candidate ควรที่จะรู้ตัวมากกว่าว่าเหมาะกับงานนั้นๆ ถึงไปสมัคร

เรารู้ว่า เราใช้ SQL ระดับไหน ทำ Google sheet ได้ระดับไหน ถึงสมัครบริษัทเหล่านั้นไป
แต่ถ้าเราอยากได้งานนี้ แต่บริษัทรู้น้อยกว่าเรา

Solution : ทำยังไงก็ได้ ให้บริษัทรู้เท่าเรา และรู้ว่าเราเก่ง บริษัทถึงรับเราเข้าทำงาน

The Psychology Resume

4 Trick for good Resume

การเรียงลำดับหัวข้อใน Resume ก็เป็นเรื่องสำคัญเช่นกัน ควรเรียงหัวข้อแบบไหน ถึงจะดี
ไม่ควรที่จะมี 1 Resume สมัคร 100 งาน ควรที่จะ Customize Resume ให้เหมาะกับงาน
Ikea Effect คนเรามักจะให้คุณค่ากับสิ่งของที่ตนเองได้ลงแรงสร้าง ประกอบ หรือมีส่วนร่วมในการทำขึ้นมา มากกว่าสิ่งของที่สำเร็จรูปแล้ว โดยคนที่พิธีพิถันในการส่ง Resume ไปทีละจะได้รับความนิยมมากกว่า
บริษัทแต่ละบริษัท มีวิธี Screen ไม่เหมือนกัน จึงทำให้ออกแบบให้เหมาะสมกับบริษัทนั้นๆ

How to choose word

Power Verb คำกริยาที่แสดงถึงชัดเจน และสื่อถึงผลลัพธ์ ช่วยให้ประโยคมีความกระชับ น่าสนใจ และแสดงถึงความสามารถหรือสิ่งที่ผู้กระทำได้ทำให้เกิดขึ้นจริงๆ มักใช้เพื่อทำให้ Resume, Cover Letter
Primacy Effect แนวโน้มที่คนเรามักจะจดจำข้อมูลหรือสิ่งที่ได้รับรู้ ในตอนแรกๆ หรือส่วนต้น ได้ดีกว่าส่วนกลาง

Be careful writing your resume

AI เวลาเขียนอะไรเรามักเขียนแบบกลาง ขาดความใส่อารมณ์ให้เรา
ทีมงานสัมภาาณ์เวลาหาคนก็มักจะหวังว่า คนที่ไปสัมภาษณ์เหล่านั้น สามารถหาคนเพื่อลดความตึงเครียดในงานนั้นๆได้
ควรเขียนเป็นภาษาอังกฤษดีกว่า

DA Resume Evaluator

ลองให้ AI ประเมินผลตนเองเพื่อให้วัดประสิทธิภาพของตัวเราเอง
สามารถให้ AI ช่วย Guide ได้เลยว่า Resume ที่ดีหน้าตาเป็นไงยังไง

Purpose and Goals:

Evaluate candidate resumes and profiles for a junior data analyst role based ont he require skills.
Assign a score from 0 to 100, Where 100 represent a perfect match and 0 indicates no match.
Provide a brief justification for the assigned score, highlighting the strengths and weaknesses of the candidate based on the required skills.
Answer the question : ‘Should we proceed to the interview with this candidate? with a ‘yes’ or ‘no’ based on the evaluation

Behaviors and Rules:

a) Carefully review the candidate’s resume and/or professional profile (e.g., Linkedin).

b) Assess the candidate’s proficiency in each of the required skills: Spreadsheets, SQL, Programming (R or Python), Dashbaord Tools (Power BI, Looker, Tableau), Basic Statistics, Basic Machine Learning Knowledge, English and Communication skills.

c) Look for specific examples and quantifiable achievements that demonstarte these skills

Scoring

a) Assign a score out of 100 based on the overall alignment of the candidate’s skills and experience with the required skills.

b) Weigh each required skill according to its importance for a junior data analyst role (e.g. SQL and Spreadsheets might be weighted more heavily than basic machine learning knowledge).

c) Consider the level of proficiency demonstrated for each skill.

Justification:

a) Provide a concise explanation of the assigned score.

b) Highlight the Skills where the candidate demonstrates strong proficiency.

c) Identify any significant gaps or areas where the candidates’s experience is lacking.

d) Use clear and objective language, avoiding subjective opinions or biases.

Overall Tone:

Maintain a professional and objective tone.
Provide constructive feedback
Be concise and to the point in your evaluation

Section Resume

Section นี้ควร 6 Part หลักๆ ดังนี้

โดยมีไฟล์ตัวอย่าง Resume ให้ลองอ่านวิธีการเขียน Resume หรือสร้าง Link Website เพื่อให้ผู้สัมภาาณ์งานสามารถกด link เหล่านั้นได้ ตามแต่ part ได้เลยครับ

Chayanon_Boonthanom_Resume Download

Contact

LinkedIN : Platform ที่ช่วยในการเชื่อมต่อหาเพื่อนแล้วสามารถสมัครงานใน platform นั้นได้เลย
Personal Website : Platform ที่ไว้แสดงผลงานส่วนตัว เช่น Project ต่างๆ
Resume : เอกสารที่ช่วยสรุปให้ผู้ถูกสัมภาษณ์รู้จักผู้สัมภาษณ์มากยิ่งขึ้น

ถ้ายังไม่มี Contact ทั้ง 3 อย่าง ยังไม่ควรจะสมัครงาน

Career Objective

สามารถให้ AI ช่วยเขียนปรับปรุง Grammar ได้แต่ควรเริ่มเขียนด้วยตัวเอง
เราเป็นใคร เราทำงานอะไรมา เรามีความฝันว่าอะไร อยากจะประสบความสำเร็จอะไรมา พยายามเป็นตัวเราเอง
ใน Resume ควรจะใช้คำศัพท์ที่มันอ่านเข้าใจง่าย

Experience

ถ้าทำงานมา 5 ที่ แล้ว 2 ที่ไม่เกี่ยวกับเรื่อง data ก็ไม่จำเป็นต้องใส่
ควรใส่สิ่งที่ทำให้เราภูมิใจกับการที่ได้ทำงานที่บริษัทเหล่านั้น ว่าช่วยสร้างประโยชน์อะไรให้องค์กรได้บ้าง
ควรใช้ Action Verbs ที่นิยมใช้ใน Resume เพื่อเพิ่มโอกาสในการได้สัมภาษณ์งานดังนี้

Word	Definition
Managed	บริหารจัดการ
Developed	พัฒนา
Created	สร้างสรรค์
Analyzed	วิเคราะห์
Led	นำ
Achieved	บรรลุผล

4. เราทำสิ่งนี้แล้วบริษัทได้อะไร Qualifiable โดนมีตัวเลขที่จับต้องได้

Template : Contributed to a [quantifiable achievement]% increase in [specific metric] by implementing [specific action based on data analysis].

Education

Coursework พยายามจะใส่ว่าที่เคยเรียนมาเกี่ยวกับเรื่องอะไรบ้าง
พยายามใส่ Project ที่เคยทำในมหาวิทยาลัยด้วยเพื่อเพิ่มโอกาสในการสัมภาษณ์งาน

Skills + Certification

ใบ Certification ควรมาจากสถาบันที่คนรุ้จักเยอะก็ดีมาก
ควรเขียนสกิลที่เกี่ยวกับข้องกับบริษัทที่เราอยากสมัครงาน

Portfolio / Project

ทำ Project ที่คนทำงานมาแล้ว 1 ปี สามารถทำได้
ควรต้องเลือก Project ที่เหมาะสมกับบริษัทนั้นๆถ้าอยากย้ายงาน
ทุกอย่างที่ควรใส่ใน resume ควร signaling เรารู้มากกว่า ควรบอกคนที่สัมภาาณ์รู้ว่า สิ่งเหล่านั้นเราสามารถทำได้เหมือนกัน
หากมีอะไรอยากเพิ่มให้ลด Font โดยรวม เพื่อให้ทั้งหมดอยู่ในหน้าเดียว

ขอบคุณเนื้อหาดีๆจาก DataRockie : https://www.youtube.com/watch?v=nt60qyaqNbI

May 14, 2025

5 Topic for Intro to Data Analytics

เป็นบทความที่เขียนเพื่อที่จะช่วยให้สามารถเข้าใจวิธีการของ Data Analytic เบื้องต้น, ความแตกต่างระหว่าง AGI vs ANI ที่เป็นปัญญาประดิษฐ์ที่มีความสามารถที่แตกต่างกัน รวมทั้งรู้จักความแตกต่างของ Data Engineer vs Data Analyst vs Data Scientist ว่าทำงานแตกต่างกันอย่างไรบ้าง และ Artificial Intelligence vs Machine Learning vs Deep Learning มีการทำงานจากข้อมูลแบบไหน โดยที่สิ่งเหล่านี้จะมีประโยชน์กับการเข้าใจโลกของข้อมูลมากขึ้น

Intro to Data Analytics

What is Data Analytics

Data Analytics คือ รูปแบบของการตรวจสอบ ทำความสะอาด แปลง และสร้างแบบจำลองข้อมูล โดยมีเป้าหมายเพื่อค้นพบข้อมูลเชิงลึกที่เป็นประโยชน์

Framework

ขั้นตอนในการจัดการข้อมูลจาก Raw data > Insight เรียกว่า 4 เฟส ของการทำดาต้า

Added – Value แกนตั้งแสดงขึ้นผลตอบแทนที่เราได้กลับมา
Complexity แกนอนแสดงถึงความยากง่ายในการทำสิ่งนั้น

Type	Definition	Sample
Descriptive Analytics	What happened?	ตั้งคำถามว่าก่อนหน้านี้เกิดอะไรขึ้น เช่นในเดือนที่แล้วมีลูกค้าประจำมาสมัครสมาชิก
Diagnostic Analytics	Why it happened?	ตั้งคำถามว่าทำไมถึงเป็นแบบนั้น เช่นทำไมลูกค้าที่ทานประจำจึงมาสมัครสมาชิกในช่วงเดือนที่แล้ว
Predictive Analytics	What will happen?	ทำนายอนาคตว่าหลังจากนี้จะเป็นยังไง เช่นถ้าออกโปรโมชั่นเมนูพิเศษสำหรับบัตรสมาชิก ลูกค้าจะซื้อเมนูในโปรโมชั่นไหม
Prescriptive Analytics	What should we do about it?	ถ้าลูกค้าไม่ต่อโปรโมชั่นบัตรสมาชิกเราควรจะทำยังไงถึงจะดึงดูดให้ต่ออายุบัตรสมาชิก

ต้องทำให้ลูกค้ากลับมาดู Netflix , Disney โดยนำเข้า ซีรี่ย์ ใหม่เรื่อยๆ เพื่อรักษาลูกค้าไว้

ในทุกๆโปรเจคเราต้องตั้งคำถามใน 4 framework

AI ณ ปัจจุบันมีโอกาสผิดพลาดได้ จึงจะต้องมี framework ที่ 5 เพิ่มเข้ามาเพื่อให้ดีและละเอียดยิ่งขึ้น

Cognitive Analytics

การวิเคราะห์ความรู้ความเข้าใจที่ใช้ในการจำลองความสามารถในการคิดของมนุษย์
การวิเคราะห์เชิงทำนาย เป็นการวิเคราะห์ข้อมูลในอดีตเพื่อหารูปแบบความสัมพันธ์ในชุดข้อมูลที่สามารถนำมาเป็นต้นแบบในการทำนาย การคาดการณ์ ผลหรือสิ่งที่น่าจะเกิดขึ้นในอนาคต

AGI vs. ANI

Artificial generative Intelligence vs Artificial Narrow Intelligence / Future vs Present

Quality	AGI	ANI
Description	ปัญญาประดิษฐ์ที่มีความสามารถทั่วไปใกล้เคียงกับมนุษย์ ทำงานได้หลายด้าน	ปัญญาประดิษฐ์ที่ออกแบบมาเฉพาะด้าน ทำงานได้เฉพาะงานหรือปัญหาหนึ่งๆ
Flow work	สามารถปรับตัวและเรียนรู้การทำงานใหม่ๆ ได้โดยไม่จำเป็นต้องตั้งโปรแกรมเพิ่ม	ถูกตั้งโปรแกรมมาเพื่อทำงานเฉพาะอย่าง และไม่สามารถทำงานนอกเหนือจากนั้นได้
Example	การเข้าใจภาษาธรรมชาติในหลายภาษา, การแก้ปัญหาที่ซับซ้อนแบบมนุษย์	ระบบแนะนำสินค้า, การรู้จำใบหน้า, การประมวลผลภาพ

Multimodal AI ที่เข้าใจ content หลายประเภทมากกว่าเดิม

Case : Open AI

Open AI แต่ละชนิดก็จะมีความเชี่ยวชาญในแต่ละด้านดังตารางด้านล่างนี้ครับ

Program	Expert
ChatGPT	Language
Dalle	Picture
Solar	Video
Whisper	Audio

ChatGPT can make mistakes. Check important info.

AI สามารถดูวิดีโอได้เยอะกว่ามนุษย์ สามารถ process ข้อมูลได้ดีกว่า มนุษย์ หา pattern ดีกว่า

Data Engineer vs Data Analyst vs Data Scientist

คุณสมบัติ	Data Engineer	Data Analyst	Data Scientist
บทบาทหลัก	ออกแบบ สร้าง และดูแลโครงสร้างพื้นฐานการจัดเก็บข้อมูล	วิเคราะห์และตีความข้อมูลเพื่อสร้างรายงานและแนะนำการตัดสินใจ	สร้างแบบจำลองเชิงคาดการณ์และวิเคราะห์ข้อมูลเชิงลึก
เป้าหมาย	สร้างระบบที่มีประสิทธิภาพ	ทำความเข้าใจแนวโน้มและให้ข้อมูลเชิงลึกสำหรับธุรกิจ	สร้างโมเดลเพื่อคาดการณ์และแก้ปัญหาทางธุรกิจที่ซับซ้อน
การใช้เครื่องมือ	SQL, Apache Spark, Hadoop, Airflow, NoSQL	Excel, SQL, Power BI, Tableau, Google Analytics	Python, R, TensorFlow, PyTorch, Jupyter Notebook
ทักษะที่จำเป็น	การเขียนโปรแกรม, การจัดการฐานข้อมูล, การพัฒนา ETL, ความรู้ด้าน Cloud	การวิเคราะห์ข้อมูล, การสร้างรายงาน, การใช้เครื่องมือ BI	การเขียนโปรแกรม, การสร้างแบบจำลอง, การใช้ Machine Learning

ส่วนใหญ่ บริษัทคาดหวังว่าอะไรพนักงานจะมี Skill เยอะๆ แล้วทำได้หลายอย่าง

Type	Example
Unstructured Data	อีเมล, PDF, โพสต์บนโซเชียลมีเดีย
Structured Data	ตาราง Excel, ข้อมูลที่ใช้ในฐานข้อมูล SQL

Artificial Intelligence vs Machine Learning vs Deep Learning

Type	Artificial Intelligence (AI)	Machine Learning (ML)	Deep Learning (DL)
Description	ระบบที่สามารถทำงานหรือคิดอย่างฉลาดเหมือนมนุษย์	สาขาหนึ่งของ AI ที่เน้นการเรียนรู้จากข้อมูล	สาขาหนึ่งของ ML ที่ใช้โครงข่ายประสาทเทียมในการเรียนรู้
Objective	สร้างระบบที่มีความสามารถในการประมวลผลข้อมูลและตัดสินใจ	สร้างแบบจำลองที่สามารถคาดการณ์หรือจำแนกประเภทข้อมูลได้	การเรียนรู้คุณลักษณะอัตโนมัติและการจัดการกับข้อมูลที่ไม่มีโครงสร้าง
Technique	การประมวลผลภาษาธรรมชาติ (NLP), การรับรู้เสียง, ระบบอัจฉริยะ	อัลกอริธึมเช่น Linear Regression, Decision Trees, SVM	โครงข่ายประสาทเทียม เช่น CNN, RNN, LSTM
Data	สามารถทำงานกับข้อมูลที่มีโครงสร้างและไม่มีโครงสร้าง	ต้องการข้อมูลจำนวนมากเพื่อการเรียนรู้ที่มีประสิทธิภาพ	ต้องการข้อมูลจำนวนมากและข้อมูลที่มีความหลากหลายเพื่อผลลัพธ์ที่ดีที่สุด

AI คือโลกใหญ่ Machine Learning คือเทคนิคที่ทำให้เราไปถึง AI ได้ Deep learning คือ Algorithm ที่เก่งมาก

Another How to find Case Studies in Chatbot

โดยเราสามารถเรียนรู้เกี่ยวกับเคสเพิ่มเติมของ data science เพิ่มได้จากการพิมพ์ถามจาก ChatGPT หรือ Gemini เพื่อช่วยที่จะเรียนรู้ได้

Prompt : I want to learn more about data science case studies.

can you help me find popular case studies in data science and data analytics. at least 2 case study

โดยสรุปแล้ว ความสำคัญของบทความนี้ได้ว่า การเข้าใจกระบวนการ Data Analytic จะช่วยให้เข้าใจเกี่ยวกับ Role อาชีพของ Data และการทำงานของ Machine Learning หรือ AI ด้วยครับ หวังว่าจะได้ผู้อ่านทุกท่านจะได้ประโยชน์จากการอ่านบทความนี้ไม่มากก็น้อยครับ ขอบคุณครับ

ขอบคุณเนื้อหาดีๆจาก Data Science Bootcamp 11 : https://data-science-bootcamp1.teachable.com/courses/data-science-bootcamp-11/lectures/60407535

April 22, 2025

10 Ideas from Google Sheets 101 เพื่อให้สามารถจัดการข้อมูลให้เป็นระเบียบมากขึ้น

Teached by Kasidis Satangmongkol (DataRockie Owner)

💕💕💕 หลังจากได้เรียน Google Sheets 101 จาก Data Science Bootcamp 11 จึงอยากสรุปความรู้เป็นบทความเพื่อพัฒนาความเข้าใจให้มากขึ้นสำหรับ Google Sheet คือโปรแกรม Spreadsheet ที่ใช้งานได้ฟรี ทำให้สามารถสร้าง แก้ไข และทำงานร่วมกับผู้อื่นได้แบบ Real-time

Why Google Sheets?

ปกติควรใช้ Spreadsheets tool ในการเก็บ วิเคราะห์ และนำเสนอข้อมูล

Spreadsheet Method	Definition
Store	เก็บรวมข้อมูล (Structure Data)
Analyze	ทำความสะอาดและวิเคราะห์ข้อมูล
Present	นำเสนอ insights ที่มีประโยชน์ต่อแบรนด์/ เรื่องที่เราสนใจ

Data Types

Type of Data

Data Type	Definition
Numeric	ตัวเลข
String	ตัวอักษร
Boolean	True or False
Date	วันที่

Foundation Date : YYYY-MM-DD

Type of Category Data

Type	Definition
Dimension	ประเภทของข้อมูลที่เป็น Category
Measurement	ระเภทของข้อมูลที่เป็น Numeric

Explain the different between dimension and measurement in a table

Punctuation

Type	Punctuation	Definition
Double quote	” “	use for message
Ampersand	&	to connect 2 message

Array Formula

มีไว้กรณีสามารถพิมพ์คำสั่งครั้งเดียวแล้วได้ผลลัพธ์ หลายๆค่าพร้อมกัน

Arrayformula Shortcut : CTRL + SHIFT + ENTER

Type	Definition	Punctation
Horizontal Structure	ข้อมูลเรียงแนวนอน	,
Vertical Structure	ข้อมูลเรียงแนวตั้ง	;
Metrix Structure	ข้อมูลเรียงแนวนอนก่อน แล้วแสดงผลแนวตั้งต่อ	, ;

Horizontal Structure

A7={1, 2, 3, 4, 5}

--------------------------------
จะแสดงผลในแนวนอนดังนี้ 1 2 3 4 5

Vertical Structure

A9={"Toy"; "Anna"; "John"; "David"}
--------------------------------
Toy แสดงผลในแนวตั้ง
Anna
John
David

Matrix Structure

={100, 200; 300, 400; 500, 600} 
----------------------- 
100 200
300 400
500 600 
แสดงผลในแนวนอนก่อน แล้วหลังจากจะแสดงผลแนวตั้งต่อจากแนวนอน

Results from Three Types of Array Formulas

Cell References

การ lock cell สามารถทำได้โดยการกด Press F4 to lock cell

Shortcut Type	Definition
A1	ไม่มีการ lock cell ใดๆ เลย
$A$1	Absolute formula จะไม่มีการเลื่อนแถวเลย
A$1	ปล่อยฟรี column A lock only row 1
$A1	lock only column A ปล่อยฟรี row 1

Method by I2 = %percent ที่เงินเดือนของพนักงานแต่ละคนจะเพิ่มขึ้น

Function

หน้าที่ของ Function การเปลี่ยน input ให้กลายเป็น output

function_name(arg1,arg2,[arg3],[arg4])
Required Argument
Optional Argument

Type	Definition
Require Argument	ตัวแปรที่จำเป็น ต้องมีสูตรถึงจะใช้ได้
Optional Argument	ตัวแปรที่ไม่จำเป็นต้องมี เป็นแค่ ตัวแปรเสริม

And or Not

Array Function ไม่สามารถใช้กับ AND, OR, NOT

Type	Definition
AND	จะแสดงค่าจริงต่อเมื่อ 2 เงื่อนไขเป็นจริง
OR	จะแสดงค่าจริงต่อเมื่อ 1 เงื่อนไขเป็นจริง
NOT	จะแสดงค่าไม่จริง

Conditions

Type	Condition
IF	แสดงผลลัพธ์แบบ 1 เงือนไข
IFS	แสดงผลลัพธ์ IF ซ้อน IF
NESTED IFS	ผลลัพธ์แบบมี IFS ตัวเดียวโดยทำอ่านง่าย
SWITCH	ใส่ Range Condition แค่ 1 รอบง่าย

Condition term

Salary	Segment
≥100K	High
≥50K	Medium
Else	Low

IF

IF(F2:F6>=100000,"High","Low")

NESTED IF

ArrayFormula(IF(F2:F6>=100000,"High",IF(F2:F6>=50000,"Medium","Low")))

IFS

ArrayFormula(IFS(F2:F6>=100000,"High",
F2:F6>=50000,"Medium",
F2:F6<50000,"Low"))

NESTED IF vs IFS

Quality	NESTED IF	IFS
Pattern	IF ซ้อน IF	สูตรสั้นกว่า
Approach	สูตรอ่านยากเมื่อมีหลายเงื่อนไข	อ่านสูตรง่ายกว่า

SEGMENT : ถ้า column F ≥ 100,000 ให้แสดง “High”, Column F ≥ 50,000 ให้แสดง “Medium” นอกนั้นแสดง “Low”
SEGMENT (IFS) : จะแยกเงื่อนไขพร้อมกันทีเดียวเลย

SWITCH

สามารถหาความสัมพันธ์ระหว่าง 2 ตาราง แล้วใส่ Range Condition แค่ 1 รอบทำให้สูตรสามารถอ่านได้ง่าย

SWITCH vs IFS

Type	Range
SWITCH	มี Range Condition 1 รอบ Column G
IFS	มี Range Condition หลายรอบ Column G

---SWITCH---
ArrayFormula(SWITCH(G2:G6,
1, "London",
2, "Madrid",
3, "Tokyo"))
---IFS----
ArrayFormula(IFS(G2:G6=1,"London",
G2:G6=2,"Madrid",
G2:G6=3,"Tokyo"))

SWITCH หาก column G = 1 show “London”, = 2 show “Madrid”,= 3 show “Tokyo”

Aggregate Function

Aggregate Function	Definition
SUM	ผลรวม
AVG	ค่าเฉลี่ย
MIN	ค่าต่ำสุด
MAX	ค่าสูงสุด
COUNTA	นับจำนวน cell ที่มีข้อมูล
COUNT	นับจำนวน cell ที่เป็นตัวเลข
MEDIAN	ค่ากลาง

Aggregate function with Conditions

COUNTIFS

Countifs

Type	Definition
Criteria_range1	ช่วงที่ใช้ในการหานับค่า
Criterion	ตัวเลขหรือข้อความที่ต้องการหานับค่า

Requirement : เราสามารถสรุปผลของข้อมูลเบื้องต้นโดยการนับจำนวน Brand หรือ Model ที่เราต้องการ

G3=COUNTIFS(C:C,"AMD",D:D,"Ryzen 5")       
ค้นหา "Brand AMD" ที่อยู่ Column C และ "Model Ryzen 5" ที่อยู่ Column D
G4=COUNTIFS(C:C,"AMD",E:E,">300")         
ค้นหา "Brand AMD" ที่อยู่ Column C และ Column E ที่มี "Quantity > 300"
G5=COUNTIFS(C:C,"AMD",B:B,"<2021-06-05")  
ค้นหา "Brand AMD" ที่อยู่ Column C และ Column B ที่มี "Date < 2021-06-05"
G6=COUNTIFS(C:C,"AMD",B:B,">=2021-06-06") 
ค้นหา "Brand AMD" ที่อยู่ Column C และ Column B ที่มี "Date >= 2021-06-06"
G7=COUNTIFS(C:C,"AMD",B:B,"<2021-06-06")  
ค้นหา "Brand AMD" ที่อยู่ Column C และ Column B ที่มี "Date < 2021-06-06"

SUMIFS

Type	Definition
Criteria_range1	ช่วงที่ใช้ในการหารวมผลลัพธ์
Criterion	ตัวเลขหรือข้อความที่ต้องการหารวมผลลัพธ์

Requirement : เราสามารถ หาผลรวม Brand and Model ว่ามี Quantity เท่าไร

G3=SUMIFS(E:E, C:C, "AMD", D:D, "Ryzen 5")   
ค้นหาช่วง Column E ที่มี "Brand AMD" อยู่ Column C และ "Model Ryzen 5" ที่อยู่ Column D 
G4=SUMIFS(E:E, C:C, "inteL", D:D, "Core i5") 
ค้นหาช่วง Column E ที่มี "Brand Intel" อยู่ Column C และ "Model Core i5" ที่อยู่ Column D
G5=SUMIFS(E:E, C:C, "inteL", D:D, "Core i7") 
ค้นหาช่วง Column E ที่มี "Brand Intel" อยู่ Column C และ "Model RCore i7" ที่อยู่ Column D

ตัวอักษรตัวเล็กกับตัวใหญ่ สามารถค้นหาได้เหมือนกัน ต้องระวัง

Filter and Sort

Filter

Filter เพื่อดึงเฉพาะข้อมูลที่ต้องการนำมาใช้ในการทำงาน “โดยดึงมาทีเดียวพร้อมกันทุก Column”

Filter by hand

press filter in google sheet เลือก Filter by Condition —> Date is before —> Exact Date —> “2021-06-06”

Filter with and, one condition, or

Filter with brand “AMD” only
=FILTER(A3:E22,C3:C22="AMD") 
เลือกช่วงทั้งหมด พร้อม Column Brand ที่มี "AMD"
Filter AND
=FILTER(A3:E22,C3:C22="AMD",D3:D22="Ryzen 5") 
เลือกช่วงทั้งหมด พร้อม Column Brand ที่มี "AMD" และ Column Model "Ryzen 5"
Filter OR
=FILTER(A3:E22, (C3:C22="AMD") + (D3:D22="Core i7")) 
เลือกช่วงทั้งหมด พร้อม Column Brand ที่มี "AMD" หรือ Column Model "Ryzen 5"

SORT

SORT FORMULA =SORT(range, sort_column, is_ascending)

is_ascending = True or False

Type	Order
FALSE	เรียงจาก มากไปน้อย
TRUE	เรียงจาก น้อยไปมาก

Sort with one condition
=SORT(A3:E22,3,TRUE)
เรียงข้อมูล Column ที่ A3 ถึง E22 โดยที่ Column Model เป็น "Column ที่ 3" ซึ่งเรียงจาก"มากไปน้อย"
Sort with two condition
=SORT(A3:E22,3,TRUE,5,FALSE)
เรียงข้อมูล Column ที่ A3 ถึง E22 โดยที่ Column Brand เป็น Column ที่ 3 ซึ่งเรียงอักษรจาก"น้อยไปมาก" และ Column Quantity เป็น Column ที่ 5 ซึ่งเรียงจาก"มากไปน้อย"

หวังว่า บทความนี้จะเป็นประโยชน์ในการช่วยให้ทุกท่านสามารถนำความรู้จาก Google sheet 101 ในไปประยุกต์ใช้ในชีวิตส่วนตัวและชีวิตการทำงานที่สามารถทำให้ทุกท่านทำงานได้รวดเร็วและสะดวกยิ่งขึ้นครับ

January 18, 2025

10 Ideas from Data Science Bootcamp batch 11 Live 01 ที่นอกจากสอน data แล้ว ยังสอนวิธีการเรียนรู้ที่ดีทำให้เปลี่ยน Mindset ให้ดีขึ้นได้ด้วย

Teached by Kasidis Satangmongkol (DataRockie Owner).

✅✅✅การที่มีโอกาสลงเรียน Data Science Bootcamp batch 11 ทำได้รับ mindset ใหม่ๆ ที่ช่วยให้เรามีความมุ่งมั่นเพื่อให้ทำสิ่งต่างๆ สำเร็จ (manifestation) มี W3Schools ให้ศึกษาหาความรู้ดีๆ ได้ตลอดเวลา ขอบคุณสำหรับโอกาสในการเรียนรู้ครับ แอดทอย 😘😘😘

Table of Content

General Bootcamp

อธิบายการเรียน bootcamp เบื้องต้น
โรงเรียนนี้พยายามจะเปลี่ยนจาก video based เป็น text-based ประมาณ 30% :ซึ่งทำให้สะดวกต่อการเรียนรู้ (No AI help)
ปัจจุบันคนเราสามารถเขียนภาษาได้เข้าใจง่ายกว่า AI เขียน ขนลุก 😍😍😍

W3Schools Academy

สำหรับการเรียน Data หรือ Coding ถ้าจะไม่พูดถึง W3Schools ก็ไม่ได้ เนื่องจากเป็นแหล่งเรียนรู้ที่คนยอมรับและนิยมเข้าไปศึกษาหาความรู้เพิ่มเติม หากเราไม่รู้จัก Function ของโปรแกรม R, SQL, Python ในการเขียนโปรแกรมก็สามารถเข้าไปหาอ่านได้ใน W3school ผู้เรียนจะมีโอกาสได้ทดลองเขียนโปรแกรมตามคำแนะนำ พร้อมคำอธิบายละเอียดเมื่อเกิดข้อผิดพลาด ทำให้เข้าใจเนื้อหาได้ง่าย❤️❤️❤️แต่การเรียน Data Science Bootcamp batch 11 สามารถมีโอกาสสอบ W3School

Website : https://www.w3schools.com/

Highlight of this class : ได้เรียนเนื้อหา W3School ทำ partnership ด้วยกัน

Language Skills

Language Skills มีความสำคัญอย่างมาก เพราะเป็นเครื่องมือหลักในการสื่อสาร สร้างความเข้าใจระหว่างบุคคล และเป็นพื้นฐานสำคัญในการเรียนรู้ โดยเฉพาะ Skill Writing และ Reading

Language Skills	Words per minute
Reading	200-250
Speaking	120-150
Listening	40-60
Writing	120-150

คนเขียนจะใช้เวลานาน แต่อ่านจะใช้เวลาเร็วดังนั้น Writing and Reading is Metaskill

5 books recommend this course

These books can be ordered on Amazon.

Book	Author	Skill	Definition
How to be better at (almost everything)	Pat Flynn	Skills	Pat Flynn is generalist can a lot of create music, fitness, write a book, do business and create content
Naked Statistics	Charles Wheelan	Rationale	Charles wheelan is economist but can write statistic to make another people understand
Think like A Freak	Stephen J. Dubner and Steven Levitt	Thinking	how to ask question
Business Made Simple	Donald miller	Business	to create write and business
It doesn’t have to be crazy at work	David Heinemeier Hansson and Jason Fried	Work	how to work from basecamp company

to understand statistic you must read english book.

Education

คนทำ Data จะรู้เรื่องเดียวไม่ได้ จำเป็นต้องรู้หลายเรื่องเช่น business, marketing เพื่อที่จะขึ้นตำแหน่งได้เรื่อยๆ
การเรียนรู้เป็นสิ่งที่ไม่มีวันสิ้นสุด และสามารถเก่งขึ้นได้เรื่อยๆ

How to Learn

วิชาที่ทุกคนจะได้เรียน ทุกคนไม่ต้องเลือกเพราะ “แอดทอยเลือกให้หมดแล้ว“
สิ่งที่เปลี่ยนแอดทอยได้คือ mindset ที่จะทำเพื่อพ่อ แม่ คือความขยัน
ดังนั้นคนเราควรพยายามคิดแบบ deep thinking ให้เยอะ เพื่อเพิ่มโอกาสในการอยู่รอดในอนาคตได้

Class	Day to learn
Main Class	Saturday
Support Class	Sunday
On demand Class	Weekday

Good Learn

Good sleep
Good Food
Good Exercise
Drink a lot of water

Good Mindset

ไม่ควรอ่านหนังสือบนเตียงนอนเพราะไม่งั้นจะง่วงนอน
คนรู้ว่าจะเรียนแล้วจะดี แต่คนมักไม่ยอมเรียน 555
สิ่งที่แรกที่ควรทำตอนเช้า อ่านหนังสือ อ่านบทความ ออกกำลังกาย take course เขียน content

ควรเรียน 1 ชมไปเลย ดีกว่าการแบ่งทีละ 15 นาที 4 ครั้ง
ต้องพยายามอย่าโดน Disrupt ในการเรียน

Research skills

Method Research สามารถเรียนรู้ด้วยตัวเอง Research ข้อมูลและเขียนขึ้นมา
Solving Problem Yourself

Skill

Google is your friend
แต่หาใน google ไม่เจอสามารถมาถามใน discord ได้
Answer are already in the internet

ทักษะการ search เป็นทักษะที่สำคัญมากในการเรียนรู้ได้ตัวเอง

Our Curriculum

Curriculum

รุ่นแรกๆ มีนักเรียน 200-300 คนแล้วเรียนผ่าน notion
ทุกสัปดาห์นักเรียนจะได้สกิลใหม่ๆ เพื่อพัฒนาไปเรื่อยๆ ทีละสัปดาห์เป็น Sprint
การเรียนแบบเป็น Agile จะช่วยจับจังหวะการเรียนรู้ได้ดี การทำงานที่รวดเร็วว่องไว เน้นผลลัพธ์ที่มีประสิทธิภาพมากกว่าขั้นตอน เช่น 1.Week 1 เรียน Welcome to The Course bootcamp 2.Week 2 เรียน Build a second brain ทบทวน Week 1

การทำแบบนี้ไปเรื่อยๆ จะช่วยให้สามารถจำเนื้อหาและประยุกต์การใช้ทำงานจริงได้

Plan Study with Sprint and Agile

You must break step of learning into Sprint.
มนุษย์เราสามารถจัดการเรียนในแต่ละสัปดาห์ๆ เพิ่มให้เราเก่งขึ้น Skill ละ 1 week
Agile จะเป็นการเรียนรู้จากเมื่อวานที่ดีขึ้นเรื่อยๆ

This school create product that is yourself can design by add skill to have add value.

Plan for learning

โดยอาจจะมีกิจกรรมดีๆ เพิ่มเวลาเรียนจากสิ้นสุดเดือนพฤษภาคมเป็นเดือนมิถุนายน

ถ้าแบ่งเวลาวันละ 40-60 นาทีก็สามารถเรียนจบได้เลย ถ้าเราเรียนรู้ทุกวัน

Learning Journey

Learning path

ไม่เรียนข้าม sprint
เรียน on-demand ก่อนเข้า
ทำการบ้าน
ทบทวนบทเรียนและ take note

Month	Learning Path
Jan 2025	Spreadsheets + Databases
Feb 2025	Programming +Data Transformation
March 2025	Statistics + Business Analytics + ML
Apr 2025	Python + Dashboard

Class Support

มักจะมี class เสริมในวันอาทิตย์โดยส่วนใหญ่
class เสริมจะช่วยให้นักเรียนได้รู้เกี่ยวกับความรู้อื่นๆที่มีประโยชน์ Data Scientist, Data Engineer, Business Analyst

Rule of Bootcamp

Rule No. 1 Skills in combination are more powerful than individual skills.

รู้เนื้อหากว้างในทุกด้านให้มากกว่าค่าเฉลี่ย แต่ไม่จำเป็นต้องเก่งที่สุดทุกเรื่อง

Pat Flynn is generalist can a lot of create music, fitness, write a book, do business and create content.
ต้องวิ่งและเล่นกล้ามพร้อมกัน ดีกว่าเลือกทำอย่างใดอย่างนึงมากกว่า เช่น วิ่งอย่างเดียว

Generalist > Specialist

Rule No. 2 Never beyond 80%

Diminishing Returns เป็นหนึ่งในกฎเชิงเศรษฐศาสตร์ ซึ่งเราเรียกกันว่า กฎแห่งผลตอบแทนที่ลดลง (Law of Diminishing)

Diminishing Returns Case

หากทำได้ 80% แล้วควรไปเรียนวิชาอื่นต่อดีกว่า
อยากกินพิซซ่ามาก พอกินชิ้นแรกอร่อยมาก แต่กินชิ้นที่ 7-8 เริ่มอิ่มแล้วไม่คุ้มที่จะกินแล้ว
ถ้าเรียนไปแล้ว 1 ชม แล้วได้ผลตอบแทนไม่คุ้มกับที่เรียน ควรที่จะไปเรียนวิชาอื่นเลย

ทรัพยากรที่ยิ่งใหญ่ที่สุดในชีวิตมนุษย์คือเวลา

Define Data Analyst, Data Scientist, Data Engineer

คนเราเป็น data anlyst อยู่แล้ว ขึ้นอยู่กับแต่ละคนมี Skill ในตัวเองกี่ %
คนทั่วไป 10% แอดทอยอาจจะเป็น 70% แต่ขอให้เองกว่าคนทั่วไป
น้องชายแอดทอยเป็นนักกฎหมายใช้ Data analyst แบบใช้ unstructure data

Type	Definition	Skill
Structured Data	ข้อมูลที่จัดระเบียบ	Spreadsheet , Table
Unstructured Data	ข้อมูลที่ไม่มีรูปแบบ	Image , Texts
Semi-structured Data	ข้อมูลที่มีโครงสร้างบางส่วน	Emails

Job	Definition	Tool
Data Analyst	วิเคราะห์ข้อมูลที่มีอยู่เพื่อหา Insight ในการตัดสินใจทางธุรกิจ	Spreadsheets+SQL+ Dashboard
Data Scientist	สร้างแบบจำลองทางสถิติ เพื่อทำนายผลลัพธ์หรือค้นหา Pattern	Python+Statistics+ML
Data Engineer	สร้างและบำรุงรักษาระบบโครงสร้างพื้นฐานของข้อมูล เพื่อให้ข้อมูลพร้อมใช้งาน	Software+Cloud+ Java/Scala

Data Analyst vs Data Scientist vs Data Engineer

Data Analyst in bootcamp

Data Analyst is Essential Skill not sexy skills
Skill ด้านล่างคือสิ่งโรงเรียน bootcamp จะได้เรียนทั้งหมด

Model 4 M Limitless

Jim Kwik, author of Limitless

Model 4 M	Definition	Sample
Mindset	ความเชื่อและมุมมองที่ส่งผลต่อวิธีการคิดและการตัดสินใจ ควรมีแนวคิดที่ยืดหยุ่นและพร้อมพัฒนา	I live, I learn, and I repeat the cycle.
Motivation	แรงผลักดันที่ช่วยให้เราเริ่มต้นและมุ่งมั่นในการทำสิ่งต่างๆ	Learning new skills enhances one’s quality of life.
Method	แผนและกระบวนการที่ใช้ในการบรรลุเป้าหมาย ควรมีวิธีการที่เป็นระบบ	Wake up early, then study, then write, and then share what you learn.
Momentum	การทำอย่างสม่ำเสมอและต่อเนื่องเพื่อสร้างนิสัย	do it every day

เราสามารถทำอะไรก็ได้ ไปทุกวันจนกลายเป็น momentum ได้ง่าย
ทำให้การเข้า live วันเสาร์เป็นเรื่องธรรมดา เพื่อที่จะสามารถเรียนรู้ไปเรื่อยๆ
ต้องพูดภาษาอังกฤษได้

Build Your Presence Online

ต้องมี LinkedIn ไว้เพื่อสมัครงาน

ใช้ Dataset สำหรับทดลองสร้าง project ของตัวเอง :

Type of Port	Definition
Portfolio	ผลงานที่สิ่งที่เราอยากจะนำเสนอให้โลกได้เข้าใจ หรือนำเสนอการวิเคราะห์ข้อมูลให้บริษัทที่เราจะไปสัมภาษณ์ได้เข้าใจว่าเราทำอะไรเป็นบ้าง
Github	เว็บไซต์สำหรับเขียน code
Website	สร้างเว็บไซต์ของตัวเอง เช่น WordPress
Blogs	เขียน Blogs เช่น Medium

Important things for applying for work

แล้วเขียน content ให้คนดูเยอะๆ ถึงจะมีโอกาสได้งานเยอะขึ้น
โรงเรียนนี้ เน้นการสร้าง project ขึ้นมาเพื่อแสดงว่าเราทำงานได้จริง
การบ้านเป็นสิ่งที่ทุกคนได้ทำแล้วได้เน้นในการลงมือทำ
ถ้าเป็นเด็กจบใหม่ แค่มี portfolio ก็มีโอกาสได้งานแล้ว
ต้องมี linkedIn ไว้เพื่อสมัครงาน

เรียนจบอะไรไม่สำคัญ สำคัญที่ปัจจุบันมี manifestation เพื่อที่จะเก่งขึ้นก็พอ

บทความนี้จัดทำขึ้นโดยหวังว่าจะเป็นประโยชน์แก่ผู้อ่านในด้าน Mindset และ Data เพื่อให้ได้รับความรู้และไอเดียที่นำไปปรับใช้ได้

https://data-science-bootcamp1.teachable.com/p/data-science-bootcamp-11

January 6, 2025

10 Method for Power BI to build your reports and dashboard
This project explores the use of Power BI to create dashboards that provide insights for improved business planning and decision-making within organizations. It also demonstrates how structured data can be presented in a clear and accessible manner through data storytelling.

Table of Content
References, Resources & Files

This project contains the following three files:

Data Files

data Download
- This is an Excel file containing cookie shipping data that use to create reports have 2 sheets.
- Sheet 1, named “Orders” shows the following data.
1. Order ID : The cookie shipping process based on each order.
2. Customer ID : The order of 5 customer IDs connected to the “Customers” sheet.
3. Rush Shipment : These shipments send express or regular.
4. Cookies Shipped : These are cookies that have been shipped to customers.
5. Revenue : Revenue from cookie sales
6. Cost : Costs of cookie sales
7. Order date : The date of shipment
- Sheet 2, named “Customers” shows details of customers who purchased products through 5 specific customers.
1. Customer ID : Ordering of 5 customers with ID numbers linked to the “Order” sheet.
2. Customer name : Show the information of the shipping company.
3. Phone : The company phone number.
4. Address : The company address
5. City : City name
6. State : State name
7. Zip : Zip Number
8. Country : Country of shipment
The data files contain raw, uncleaned data.

Practice Files

practice Download
- A dataset of chocolate product information for a company that use to create reports have 2 sheets.
- Sheet 1, named “Orders” contains the following data:
1. Order ID :The chocolate chip product distribution process based on orders.
2. Order_date : The chocolate chip product shipping dates by order.
3. Retail_number : List of retailers connected to the “Retailers” sheet.
4. Quantity : The quantity of the product.
5. Unit Price : Price per unit.
6. Total Amount : Quantity * Unit Price
7. Product : Chocolate Chip
- “Sheet 2, named ‘Retailer_Names‘, shows the following data:”
1. ID : list of retailers linked to the ‘Orders’ sheet“.
2. Retailer_Name : The name of the retail company
3. Country : The countries of shipment
The practice files contain raw, uncleaned data.

Final Files

final Download
- This data file is for Emma’s coffee shop capstone task that use to create reports have 5 sheets.
- Sheet 1, named “Orders” contains the following data:
1. Order ID : Show the order list of Coffee shop
2. Customer ID :Show the customer sequence connected to the ‘Customer ID’ sheet.
3. Product ID : Show the product sequence linked to the ‘Customer ID’ sheet.
4. Quantity : The quantity of coffee.
5. Unit Price : The price of coffee per unit.
6. Order date : The date of the coffee sale.
- Sheet 2, named “Feedback” displays the following data:
1. Feedback ID : Order of displaying customer feedback and suggestions.
2. Customer ID : Show the customer order based on the “Orders” sheet.
3. Rating : Coffee Rating
4. Feedback : Text displaying customer feedback.
5. Feedback date : Date of coffee shop reviews.
- Sheet 3, named “Inventory” displays the following data:
1. Product ID : Show the product sequence from the “Orders” sheet.
2. Stock Level : Stock levels of coffee
3. Reorder Level : The stock level at which a new order should be placed.
4. Supplier : Who buys products for another person.
- Sheet 4, named “Customers ID” displays the following data:
1. Customer ID : Show the customer list connected to the “Orders” sheet.
2. Customer Name : Customer’s name of Coffee Shop.
3. Email : Email of customer’s Coffee Shop.
- Sheet 5, named “Product ID” displays the following data:
1. Product ID : Show the product list connected to the “Orders” sheet.
2. Product Name : Name of product in Coffee shop.
3. Category : Category of product in Coffee shop.
4. Price : Price of product in Coffee shop.
Navigate the power BI interface and import data
- Understand the basics of spreadsheets.
- Familiarity with data cleaning and IF/SUM functions in spreadsheets is required.
- The Microsoft account used to publish reports to Power BI.
Learning Objective
- Explore additional Power BI features like filtering publishing and graphs customization.
- Build report with visualizations
- Manage data relations and transform data with power Query
you could build a data model and start to build some visuals and add some filters to our data. Then, we’ll look on how to add a theme and share your report and dashboards with others.

Your Role : Data Analyst

You are a Data Analyst for Cookie Bliss, your mission is to create a Power BI report to help visualize key metrics more effienctly and then publish the dashboard to share it with your teammates, providing them with sales insights for better decision-making

Start Power BI
- Then, open Power BI.
Blank Report
- Then, Choose Blank Report
Visualizations Pane

The side column of this table highlighted in red above the picture has four rows call “Visualizations Pane“.

Visualizations Pane Type Definition
Report View The Dashboard Report
Table View Dataset Details
Model View Data Relationship Model
DAX Query View DAX Query Editor
- To create a dashboard, data must be selected.
Get Data –>Excel Workbook –>Select the file you want to use to create a dashboard.

Get Data

start from export file “data“
- and can add data can be exported from various common data sources.
- –> Choose file “data”
- –> Choose Customer2 table and Order1 table then click load.
Load Data

Transform data using Power Query

how to transform data before starting to build our report.
- can check the Data and Visualization areas.
Check Data and Visualization Zone
- Picture above show the Data and Visualization areas to see how each data element should be displayed.
Transform data process

Let’s start in the Table view.
1. In the Table view, the Rush Shipment column appears to be unnecessary from Order1 Table
Rush Shipment Column

2. In the Table view, the Customer2 table has a column with both “United States” and “US” as values, which should be standardized to one name.

Customer2 table with United States and US

Remove column “Rush Shipment”

—> Choose orders 1 Table —> Choose Column Rush Shipment —> click Remove

Remove Column Rush shipment
- After following the steps in the image, It will get the desired columns as follows:
- The Rush Shipment column has been removed.
Rush Shipment column has been removed

Then, clicking “Close & Apply” will result in a cleaned data table.

Replace Value
- Click Transform Data to Replace values.
Transform Data to replace values.
- In the Customer 2 table, replace all “US” values in the Country column with “United States.” and then click OK
to change US to be United States
- As a result, Country column successfully replaced value as below.
successfully replaced value

Insert and format a visualization
- learn how to visualize data from multiple tables and customize graph formatting options.
- create a line chart.
- Use the up arrow to change the data scale (day, month, and year)
Axis Column of Data File
Y-axis Sum Cookies shipped
X-axis Order Date

Line Chart

Visual
- can adjust the visual appearance of the data through the following Visual Format options.
Visual
- can edit Values.
- can choose Data label to show number in graph.
General
- can adjust the font size.
- can adjust the graph’s position.
- can adjust the graph’s color.
- can change the title.
General

Practice Data Visualization in Power BI
- You are a data analyst working with a chocolate factory and would like to help them visualize their data.
- The data is presented in the readings under “practice.xlsx” You need to import this data to Power BI, do the necessary transformation and data cleaning using Power Queries and then build the relationship betweendiffrerent columns. Finally, I need you to create a line chart that shows the evolution of sales over time.
- start from export file “practice“
export file “practice“
- upload Orders table and Retailer table —> click load as below
Orders table and Retailer table
- Choose the Retailer Table to use Replace Value.
- Change “UK” to “United Kingdom“.
Change “UK” to “United Kingdom”.

Then close and apply, and return to the dashboard.

Relationship
- To create a relationship between orders and retailers in Power BI
relationship between orders and retailers.
- drag id to retailer_number
- then click save to connect new relationship between Retailers id and Order retailer number
connect between Retailers id and Order retailer number
- Now that the data is cleaned and the relationship is created.
to create Line Chart between Order_date and Quantity

Line Chart between Retailers id and Order retailer number
- then click line chart
Axis Column Data Files
X-Axis Order date by day
Y-Axis Quantity

Now I can see the data by day of the quantity of chocolate bars sold. Now that you have a better handle of importing and preparing data into PowerBI and inserting your first visual.

Add more visuals to enhance storytelling
- show how to add additional graphs and pages to a report, as well as explore filtering options using maps, tables and bar graph.
Card Chart
- to show the total revenue that we earned here at the Cookie Bliss.
how to create card chart
- Choose card
- Choose fields as sum of revenue of Orders1 Table.
card chart with revenue

then it show revenue value

Table
- Show table of customer name
customer name
- Drag Customer name column then it show table of customer name.
Customer name Table

Then it can show the interaction between Customer Name and Revenue value on the dashboard.

Map
- Choose map and then select country column
- then it show maps of country
Map

Publish the report to the Power BI workspace
- then can change theme of dashboard by this theme.
Change theme
- add the customer id and the phone number is good choice
customer id and phone number
- then add this title name as “Cookie Bliss Sales dashboard”
Cookie Bliss Sales dashboard
- publish to Power BI —> My workspace
- When it have success It finish publish Power BI
Publish Power BI

Cumulative Activity Scenario
- then get data final files to upload and do visualization
Choose Final File

then choose all 5 tables Customers, Feedback, Inventory, Orders and Products

Products
- In Product table must clean data first
Replace Value Product table
- Value to find “Cofe” replace with “Coffee”
- to make category have Coffee and Pastry
Inventory
- In Inventory table must clean data first
Replace Value Inventory table
- Value to find “BeanWorld” replace with “Bean World”
- to make supplier have BeanWorld and PastryPro
Create Sales reports
- Create Title Sales Report
- Create Table
Create table in Sale Report
- Select Customer name and Email to create table
- Create line chart
Create line chart with order date and Quantity

Axis Column Final Files
X-Axis Order Date
Y-Axis Quantity
- Create stacked bar chart
Create stack bar chart with Category, Product Name and Quantity

Type Column Final Files
Y-Axis Category
X-Axis Sum of Quantity
Legend Product Name

to see coffee that people love eat it.

Full Sale Report

Sale Report

Create management reports
- Create Title as management reports
- Create card of average rating
card of management report
- Select Feedback table with Rating column and value with average of rating.
- Create table with customer name and emails
table with customer name and emails
- Select Column as customer name and Email
- Create stack bar chart
Create stack bar chart with category and Stock level

Axis Column Final Files
Y-Axis Category
X-Axis Stock Level
- Create clustered column chart
clustered column chart with Reorder Level and Supplier

Axis Column Final Files
Y-Axis Count of Reorder level
X-Axis Supplier

Full Management Reports

Management Report

Summary

I hope this project helps users create dashboards in Power BI, improve their proficiency with the tool, and learn effective techniques for using titles, colors, and themes.

Coursera : https://www.coursera.org/account/accomplishments/verify/YWAXF815EXNK
January 1, 2025

5 Steps with Project introduction SQL

SQL (Structured Query Language) is a powerful programming language used for managing and manipulating relational databases. It allows users to create, retrieve, update, and delete data efficiently within a database system. SQL is widely used across industries for tasks ranging from data analysis to database management.

5 Method Of Introduction SQL

1. Database

2. Tables

3. SQL Data Types

4. Introducing Queries

5. Writing Queries

Table of Content

Relational Database

A relational database is a type of database that organizes data into structured tables (relations) with rows and columns.

Database

purpose of introduction with SQL

Understand databases and their structure
Extract Information from databases using SQL

Table show relation patrons, books and checkouts

Patrons Table

Column (field name)	Definition
card_num	card number
name	name
member_year	the year the patron became a library member.
total_fine	the total overdue

Relational Database – -> relation between tables of data insider the database

Database Benefits

Database have more storage than spreadsheet application.
Many users can write queries to gather insights from the data at the same time.

when a database is queried, the data stored insider the database not change.

Tables

Definition

databases are organized into tables, which hold related data about a particular subject.
tables are organized into rows and columns.
in the world of databases, rows are often referred to as records and columns as fields.

relation between patrons table and checkouts table connect with card_num column,
book table and checkouts table connect with id column.

Create Table Name

lowercase
no space and – in table name (use underscores instead)
plurals

Record and Field

Records

Laying the table : records
A record is a row that holds data on an individual observation.

Fields

Laying the table : fields
A field is a column that holds one columns of data for all records.

Table manner

Qualification

Singular name

No lowercase

No space

be different from other field name

be different from the table name

Assigned seats

A unique identifier is used to identify records in a table.
Distinct and often number.

Create books table

A database has been set up for this course and the books table is available here.

Run the code to explore what data books holds!.

SELECT * FROM books;

SQL data types

When a table is created, a data type must be indicated for each field. The data type is chosen based on the type of data that the field will hold a text and number.

SQL Data type	Attribute
String	letters or punctuation
Integers	whole number
Floats	fractional number

String

String is a sequence of characters such as letters or punctuation.
VARCHAR is a flexible and popular string data type in SQL.

Integers

Integers is whole number
INT is popular integer data type in SQL.

Floats

Float store numbers that include a fractional part
NUMERIC is popular float data type in SQL

Schema

A schema shows a database’s design, such as what tables are included in the database and any relationships between its tables.

Querying

Introducing queries

Benefits of SQL

use SQL to find which books James checked out from the library in 2022.

relation between card_num **checkouts** tables and **patrons** table.

use SQL queries to uncover trends in website traffic, customer reviews, and product sales.

Question

Which products had the highest sales last week?

Which products get the worst review scores from customers?

How did website traffic change when a feature was introduced?

Keyword

Keyword is word for operations. Common keywords : SELECT, FROM

The SELECT keyword indicates which fields should be selected

The FROM keyword indicates the table in which these fields are located

SELECT name FROM patrons;

Selecting multiple fields

Can select field to that want show data example card_num and name

SELECT card_num, name FROM patrons;

It will show field that select first as picture below.

SELECT **card_num, name** vs SELECT **name, card_name**

Selecting all fields

if you want to show all data use asterisk(*) to select all four field name.

SELECT * FROM patrons;

Writing queries

Aliasing (Rename Column)

Use aliasing to rename column.

Use SELECT name AS first_name to change field name from name to be first_name.

SELECT name AS first_name, year hired
FROM employees;

Selecting Distinct Records

if you select year_hired it will show result duplicate year 2020 and 2021
we can add the DISTINCT keyword before the year_hired that make data show 4 year distinct.

SELECT DISTINCT year_hired
FROM employees;

SELECT **DISTINCT** year_hired FROM employees;

Distinct with multiple fields

add the DISTINCT keyword before the fields to select
the department id and year_hired fields still have repeat values individually, but none of the records are the same

SELECT DISTINCT dept_id, year_hired
FROM employees;

SELECT **DISTINCT** dept_id, year_hired FROM employees;

Views

A view is a virtual table that save SQL SELECT statement
When accessed, views automatically update in response to updates in the underlying data.

CREATE VIEW employee_hires_years AS
SELECT id, name, year_hired
FROM employees;

CREATE VIEW, then the name will create the new view.

Using views

we can query it just as we would a normal table by selecting FROM the view.

SELECT id, name
FROM employee_hire_years;

**SELECT** id, name **FROM** employee_hire_years;

-- create the view:
CREATE VIEW library_authors AS
SELECT DISTINCT author AS unique_author
FROM books;

-- Select all columns from library_authors
SELECT * FROM library_authors

Viewing your query

You have worked hard to create the below SQL query:

SELECT DISTINCT author AS unique_author
FROM books;

SQL flavors

Both free and paid
All used with relational database
Vast majority of keywords are the same
All must follow universal standards

Two popular SQL flavors

PostgreSQL	SQL Server
Free and open-source relational database system.	Has free and paid version
Created at the university of California, Berkeley	Create by Microsoft
“PostgreSQL” refers to both the PostgreSQL database system and its associted SQL flavor	T-SQL is Microsoft SQL flavor, used with SQL Server databases

Comparing PostgreSQL and SQL Server

--PostgreSQL:
SELECT id, name
FROM employees
LIMIT 2;
--SQL Server:
SELECT TOP(2) id, name
FROM employees;

SQL Server using the TOP keyword instead of LIMIT. Notice that this keyword is the only difference between the two queries!

Summary

“I hope the foundational knowledge for advancing projects in the following five areas will be helpful for everyone:

Database
Tables
SQL Data Types
Introducing Queries
Writing Queries”

link from : https://www.datacamp.com/completed/statement-of-accomplishment/course/105fbc7d0c0243f35770512f02a1ad437c184bd7

Introduction to SQL course

December 18, 2024

5 Steps of Data Analysis By Using Microsoft Excel

Data analyst is an essential tool that enables organizations to gain deeper insights into their data. Utilizing Microsoft Excel for efficient data processing facilitates accurate and prompt decision-making while identifying trends and uncovering new business opportunities.

Table of content

The 5 steps for analyzing the sales_data_analysis.xlsx file in Microsoft Excel 365 are as follows

Excel can help data analysis

Excel work with a prepared spreadsheet that contains sale

The 5 steps for analyzing the sales_data_analysis.xlsx file in Microsoft Excel 365 are as follows

Upload a document using the free online version of Microsoft Office 365

Click add new → File upload → then upload → sales_data_analysis_23.10.2024

Go to Insert → Table to create a table that uses the header in the first column to filter data. → Click OK.

To Create a table to filter data, see the picture below.

Then filter the data shown in the picture below.

can **filter data** by columns such as Num, Date, Month, Sales Rep, Region, Customer ID, etc.

Set it up so that when you scroll down to view data in the rows below, the first column remains visible. This makes it much easier to reference the headings.

Perform data analysis using sorting and filter tools.

Which column should be prioritized for sorting data to make it more effective?

representative header and then select sort A to Z to sort it in alphabetical order

Sort the **Sales Rep column** from A to Z.

after click it has been rearrange by alphabetical Sales Rep
then to make it back to select sort in column date again
can sort the Region by North.

To remove the filter, click the ‘Select All‘ checkbox to display sales from the North, South, and West regions, and then click.
Then filter the Sales Rep column by the name David Garcia.

What you can see from data?

Average of $7,893

Count of 9

Sum of $71,040

you can see aggregate value in the bottom right corner.

This is how to use the sorting and filtering tools to rearrange your data.

Perform data mining using the IF Function

The idea behind data mining is to take the data you already have and create new or additional data from it.
The IF function is frequently used.

Samples show that when an order includes 20 chairs or more, the client receives a 5% wholesale discount.

Discount Column 3 Method

1. Create a discount column to the right to reflect this.

2. In the column, use ‘Y‘ for orders with a quantity ≥ 20.

3. In the column, use ‘N‘ for orders with a quantity ≤ 20.

Code for column Discount

=IF(J5>=20,"Y","N")

**Create** a discount column based on the quantity in the number column.

Final Price column

Code for column Final Price

=IF(J5>20,0.95*L5,L5)

**Create** a Final Price column based on discount and number column.

column of Discount with Y is number ≥ 20 price is discount 5% final is 95% from total column
column of Discount with N is number ≤ 20 price is same as the total

Create references between tables and search for information with VLOOKUP

Goal is to insert the company name using the client ID.

Create column Company Name between Customer ID and Color
Create column Company Name Representative between Company name and model

The explanation of the variables used in VLOOKUP.

VLOOKUP(lookup_value, table_aray, col_index_num, [range_lookup])

Create Company name column

=VLOOKUP(F5,'Customer Info'!$A$4:$C$12,2,FALSE)

Create Company name representative

=VLOOKUP(F5,'Customer Info'!$A$4:$C$12,3,FALSE)

to connect data between sales data and customer info sheet

Create column **Company name** and **Company Name Representative** column in sale data sheet

can adding a dollar sign in front A4 and C12 to make data not change

Perform data analysis using Pivot Table

Method to start

Typing control A and it will automatically highlight all the data in the table
Then insert ———> Pivot table ———> click okay

Recommend selecting new worksheet so it placement does not affect the other data that already exists

Pivot table

define categories as either filters, columns, rows, value

Type	Definition
Filter	2 Category
Column	Category can specified as a column to fit with data
Row	Category can specified as a row to fit with data
Values	Must use with number

1. drag Final Price to Value of pivot table
2. drag Sales Rep to columns of pivot table
3. drag Month to row of pivot table

how many chairs of each model were sold in each month?

Pivot table many chairs of each model per month.

Finally you have learned how to create pivot tables to summarize and look at comparisons within your data.

Access full Microsoft excel through below link:

sales_data_analysis_23.10.2024_chayanon_boonthanom Download

Coursera Project Network Certificate

https://www.coursera.org/account/accomplishments/verify/2F13S7V2LEZF

Summary

Data analysts help organizations gain valuable insights from data. Using Microsoft Excel enhances decision-making by processing data efficiently and identifying trends and opportunities.

December 17, 2024

10 Social Listening of Data Analyst สามารถหา trend ของข้อมูลได้

ทำไมเราถึงควรใช้ Social Listening Tool ด้วย

สามารถเข้าใจข้อความที่ลูกค้าพูดถึงแบรนด์ของเราได้ดีขึ้น
เข้าใจสิ่งที่ลูกค้าชอบและไม่ชอบแบรนด์คืออะไรบ้าง พร้อมข้อเสนอแนะเพิ่มเติม
Crisis management ปัญหาที่จะเป็นอันตรายต่อองค์กร ต้องรู้จักวิธีรับมือทาง Social media

มีการแชร์ *Hashtag* ของเราไปกี่ครั้ง *Engagement* เป็นอย่างไร

คอร์ส Social Listening ฟรีจาก DataRockie : https://data-science-bootcamp1.teachable.com/courses/enrolled/2121068

Social Listening

Social Listening Tools เป็นเครื่องมือที่ธุรกิจใช้เพื่อติดตามและวิเคราะห์ความคิดเห็นของผู้บริโภคบนโซเชียลมีเดียและช่องทางออนไลน์ต่าง ๆ ข้อมูลที่ได้รับจากการรับฟังเหล่านี้สามารถให้ข้อมูลเชิงลึกสำคัญเพื่อใช้ในการวิจัยตลาด ช่วยให้ธุรกิจปรับปรุงผลิตภัณฑ์และแคมเปญการตลาดได้อย่างมีประสิทธิภาพ

ควรใช้ keywords ในการดึง (extract) ข้อมูลจาก social media platforms ต่างๆ เช่น ถ้ามีคนโพสต์หรือคอมเมนต์เกี่ยวกับ “Samsung” ระบบจะสามารถดึงทุก contents ที่มีคำว่า “Samsung” มาให้หมดเลย

เป็นการสร้าง keyword ของแบรนด์เหล่านั้นและรุ่นเหล่านั้นขึ้นมา เพื่อให้สามารถไปค้นหาใน hashtag ได้ platform ออนไลน์ต่างๆ

Hashtag

# Samsung

# SamsungGalaxy

# withGalaxy

แล้วจะส่งเหล่านี้ผ่าน facebook , twitter แล้ว extract data ข้อมูลออกมา

Content in Social Media จัดเป็น unstructure data เพื่อจัดข้อมูลที่ไม่มีโครงสร้าง

Type	Definition
Content type	ประเภทของเนื้อหา
Account	บัญชีผู้ใช้
Date & Time	วันที่และเวลา
Content	เนื้อหา
Engagement	การมีส่วนร่วม

รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

วิธีการของ Social Listening จะเปลี่ยนจาก unstructure data เป็น structure data

สามารถดึง post กับ comment ได้ดังรูป
สามารถนำ comment ไปปรับปรุง content ได้

Post ตามรูปแนบ รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

How Starts with Keywords

ต้องมีการกำหนดคำขึ้นมาก่อนว่าควรใช้คำไหนเริ่มต้นก่อนดีกว่ากัน

ต้องไปศึกษาแบรนด์นั้นๆเลยว่า “คำ” ไหนที่แบรนด์เหล่านั้นใช้บ่อย

Popular hashtags

BRAND	MODEL
Samsung	GalaxyS23FE
ซัมซุง	Galaxy+S23+FE
galaxyUnboxing
galaxyTH
SamsungTH
SamsungGalaxyTH

เทคนิคที่หลายๆ platform รองรับคือการใช้อักขระพิเศษอย่างเช่น + (สำหรับ AND) หรือ – (สำหรับเขียน negative keywords)

Galaxy+S23+FE hashtag ที่มีการ mention “Galaxy” และ “S23” และ “FE” ในโพสต์เดียวกัน แต่ไม่จำเป็นต้องเขียนติดกันก็ได้

Keyword Strategy

โดยทั่วไปเครื่องมือ social listening จะให้เราใส่ keyword ได้หลายรูปแบบ โดยแบ่ง keyword ออกเป็น 3 ประเภท (ตามเครื่องมือของ Zocial Eye ของบริษัท Wisesight)

Type	Definition
Main Keyword	ใช้ดึงข้อมูลหรือ message เข้ามาที่ระบบ
Sub Keyword	ใช้จับกลุ่ม message เช่น การทำ tag/ category
Negative Keyword	ใช้ลบ message ที่ไม่เกี่ยวข้องออกจากระบบ

รูปภาพประกอบ **Main Keyword**, **Sub Keyword**, **Negative Keyword**
รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

ZOCIAL EYE คือ เครื่องมือวิเคราะห์ข้อมูลโซเชียลมีเดีย ที่สามารถดึงข้อมูลจากแหล่งโซเชียลมีเดียต่างๆ ไม่ว่าจะเป็น Facebook, Twitter

An Overview of Our Data

**Overview of Our Data** แสดงรายละเอียดของตัวแปรที่ผลต่อข้อมูลและความหมายของตัวแปรเหล่านั้น

เครื่องมือ Social listening จริงๆมีให้เลือกเยอะมากดังนี้

Brand Watch
Sprout Social
Keyhole
Meltwater
Brand24

ราคา package จะขึ้นอยู่กับสองปัจจัยหลักๆคือ

จำนวน keywords เช่น 10 คำ, 20 คำ เป็นต้น
จำนวน messages ที่เราสามารถดูดเข้าในระบบได้
ยิ่ง keywords และ messages เยอะขึ้น ราคาจะสูงขึ้นด้วย

เครื่องมือ social listening ที่บทความนี้จะนำมี 4 รายการดังนี้

Google Trend
Zocial Eyes
Supermetrics
Google Sheets & Looker Studio

Google Trends

Google Trends คือ เว็บไซต์ Social Listening ฟรีจาก Google สำหรับค้นหาและตรวจสอบความนิยมของ keyword จากคนใช้งานบน Google และ Youtube ณ ช่วงเวลาใดเวลาหนึ่งจากทั่วโลก ที่ทำให้เรารู้ว่าตอนนี้กลุ่มเป้าหมายของเรากำลังสนใจเรื่องอะไร และใช้คีย์เวิร์ดแบบไหน ได้อย่าง Real-Time และ Worldwide

ให้เห็นภาพกันมากขึ้น ลองเอากระแสสุดปังอย่าง ‘หมีเนย’ กับ ‘หมูเด้ง’ มาค้นหาเทรนด์ความนิยม จะเห็นจากกราฟว่าในช่วงวันที่ 09 ก.ย. 68 มีคนให้ความสนใจและเสิร์ชคีย์เวิร์ดคำว่า ‘หมูเด้ง’ มากกว่า ‘หมีเนย’ เกือบเท่าตัว

Zocial Eye is The Only Tool You Need

ทำ social listening เกี่ยวกับข้อมูลที่ลูกค้าสนใจแล้วนำไปทำ visualization เป็น graph ให้ลูกค้าใช้งาน
นำข้อมูลที่เป็น trend ไปนำเสนอลูกค้าด้วย
Zocial Eye ทำได้ตั้งแต่ใส่ keywords เพื่อ extract ข้อมูล ทำความสะอาด ติด tag/ category ทำ sentiment ไปจนถึงการ visualize ข้อมูลให้เข้าใจง่าย

Zocial Eye เป็นเครื่องมือ social listening / analytics เบอร์หนึ่งของประเทศไทย เปิดให้บริการมาตั้งแต่ปี 2007 รายได้มากกว่า 230 ล้านบาท (2565) ปัจจุบันมีพนักงานมากกว่า 250 คน

Supermetrics

สามารถดึงข้อมูลจาก Social Media Platforms ด้วยตัวเอง เช่น Facebook, Instagram, Twitter โดยใช้ Supermetrics + Google Sheets

Google Sheets & Looker Studio

โดย 2 เครื่องมือที่นิยมใช้ในงาน Social Listening คือ Excel/ Google Sheets และ BI Tools เช่น Looker Studio หรือ Power BI เพื่อทำรายงาน dashboard ติดตามผลง่ายๆ

ใช้สำหรับ Social Listening ใช้ Google Sheets + Looker Studio เป็นหลัก

Case Study : Oppo Find N2 Flip Launch

ทีมการตลาดของ Oppo Global ได้หา insights ว่าลูกค้าที่ใช้โทรศัพท์จอพับมีปัญหาอะไรบ้าง (จากการทำ research และ social listening) พบว่าสามปัญหาที่ลูกค้าบ่นเยอะที่สุดคือ

แบตเตอรี่ของ Foldable Smartphone ใช้งานได้ไม่เต็มวัน ต้อง Charge บ่อยๆ
คนยังกังวลเรื่องหน้าจอ กลัวว่าจะพังง่าย พับบ่อยเป็นรอย
หน้าจอด้านนอกเล็ก ใช้งานไม่สะดวก (อ้างอิงจากคู่แข่งโดยตรงอย่าง Galaxy Z Flip)

Problem of user experience
รูปภาพจาก Social Listening — **Data Analyst Edition** by DataRockie

Oppo Global นำ insights ที่ได้มาใช้พัฒนา Oppo Find N2 Flip แก้ปัญหา pain points ที่ลูกค้าเจอ เปิดตัว official launch ที่ประเทศอังกฤษในเดือน ก.พ. 2023 โดยเน้นที่สามปัญหาข้างต้น

หลังจากเปิดตัว Oppo Find N2 Flip ได้รับเสียงตอบรับจากกลุ่ม Mass Segment ดีมาก ราคาเปิดตัวต่ำกว่าของ Samsung Galaxy Z Flip4 ประมาณ 10,000 บาท

ด้านล่างเป็น Feedback จากลูกค้าในวีดีโอไลฟ์เปิดตัว และโพสต์โปรโมชันบน Facebook Oppo Thailand

Comment in Social Media
รูปภาพจาก Social Listening — **Data Analyst** Edition by DataRockie

Tip – เวลาเรานำ comments จริงๆของลูกค้ามาใส่ใน report เรียกว่า verbatim หรืออีกชื่อคือ word of word งานวิจัยสาย qualitative จะใช้แบบนี้กันเยอะเลย

Social Listening Method	Definition
Extract	ดึงข้อมูลตาม keyword ที่เรากำหนดจาก social media platforms ต่างๆ
Analyze	ทำความสะอาดและวิเคราะห์ข้อมูล
Present	นำเสนอ insights ที่มีประโยชน์ต่อแบรนด์/ เรื่องที่เราสนใจ

Extract ใช้ Main keyword ในการดึงข้อมูลขึ้นมา
ระหว่าง Extract กับ Analyze ใช้ Negative key word ในการดึงข้อมูลที่ไม่สำคัญทิ้งไป
Sub keyword ใช้ในการวิเคราะห์ เช่น กล้อง ราคา promotion

Zocial Eye Dashboard
รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

Extract Data for Zocial Eye

Extract Data From Platform

การดึงข้อมูลจาก Platform ต่างๆ

Facebook Data
Instagram Data
Application Program Interface

Facebook Data

จะเริ่มดึงข้อมูลจาก Social Media Platform โดยใช้ Supermetrics ในการดึงข้อมูลจาก facebook public page
สามารถใช้ application นี้ฟรี 14 วัน

ตัวอย่างนี้เป็นการลองดึงข้อมูลจาก Facebook Public Post แบบง่ายๆด้วย Supermetrics แค่เลือก data source -> page -> period -> dimensions/ metrics ที่ต้องการ

Instagram Data

Extract ข้อมูลผ่าน IG Data

โดยการดึงข้อมูลจาก Instagram ผ่าน hashtag

Application Program Interface

รูปภาพ **API Process** จาก Social Listening — Data Analyst Edition by DataRockie

API ย่อมาจาก “Application Program Interface” (ส่วนต่อประสานโปรแกรมประยุกต์) ในบริบทของ API คำว่า “Application” หมายถึงทุกซอฟต์แวร์ที่มีฟังก์ชันชัดเจน ส่วน “Interface” อาจถือเป็นสัญญาบริการระหว่างสองแอปพลิเคชัน ซึ่งสัญญานี้จะกำหนดวิธีที่ทั้งสองสื่อสารกันโดยใช้คำขอและการตอบกลับ

Zocial eye สามารถดึงพวก content ใน twitter ให้เราได้เลย

Set Up Campaign

ข้อดีคือระบบจะดึงข้อมูลที่เราต้องการให้เราได้เลย โดยไม่ต้องสนใจความซับซ้อนของ API เนื่องจากหลังของพวก twitter มีการเปลี่ยนแปลงบ่อย
ใส่ “Message” ที่เราอยากดึงข้อมูลออกมาตามนี้ได้เลย

รูปภาพ keyword and sub keyword จาก Social Listening — Data Analyst Edition by DataRockie

Check Period

หาช่วงเวลาให้ตรงกับที่เราสนใจก่อน แล้วจะได้เปรียบเทียบข้อมูลทั้ง 3 platform facebook, new and other

สามารถดูหน้าโพสต์นั้นผ่าน All channels skoodio Page ได้เลย

สามารถเปลี่ยน sub keyword หรือ message ที่จะหาได้ตลอดเลย
สามารถดู sentiment ได้ตรง top message engagement ได้เลย

สามารถเก็บได้แค่ public account ได้

Review Messages

เราสามารถเข้าไปดู Messages หรือข้อความทั้งหมดที่ระบบ Zocial Eye ดึงมาให้เราได้เลย

Export Data

เลือก period ที่เราต้องการ เช่น last 30 days คลิกที่ปุ่ม Export เพื่อส่งข้อมูลไปที่อีเมลของเรา ข้อมูลจะมาในไฟล์ Excel .xlsx format

กดปุ่ม export แล้วดึง all channel ออกมาให้หมดเลย แล้ว กด summit แล้วจะได้ข้อมูล

Analyst Data for Zocial Eye

ก่อนเราจะวิเคราะห์ข้อมูลได้ ต้อง make sure ว่าข้อมูลเราสะอาดก่อน ideal = 100%

Process รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

Delete Message ที่สัญลักษณ์ถังขยะ
Exclude account ลบ message ที่มาจาก Account นั้นให้กด Exclude
Adjust sentiment เปลี่ยนได้โดยเลือก Postive, Negative, Neutral
Adsign category เลือก category ที่สนใจ

Tag & Category

Tag & Category คือการสร้าง Dimension ไปแปะข้อความที่เราเก็บข้อมูลมา

เราสามารถติด tag หรือ category ที่แต่ละข้อความเพื่อจัดหมวดหมู่ สร้าง dimensionไว้ใช้วิเคราะห์ข้อมูลต่อไป ในหนึ่งโปรเจ็คอาจมีมากกว่า 1 category(s) ในแต่ละ category มีได้มากกว่า 1 tag(s)

ตัวอย่างการแบ่ง **Category** เป็นภาพใหญ่ **Tag** เป็นภาพย่อยอีกที

Type	Definition
Category	ภาพใหญ่ของการจัดกลุ่ม เช่น ประเภทสินค้า (CPU, IP68)
Tag	ภาพย่อยที่ให้รายละเอียดเฉพาะเจาะจง เช่น กันน้ำ, กันน้ำกันฝุ่น ภายใต้ประเภทเทคโนโลยี

ตัวอย่าง **Project, Category, Sentiment, Tag**

Assign Tags in Google Sheets

Topic, Brand, Promotion, Camera, Interest

Sentiment Analysis

sentiment analysis with AI

มาลองใช้ gemini.google.com ช่วยทำ sentiment analysis ง่ายๆ ได้ทั้งภาษาไทยและอังกฤษ

ตัวอย่าง prompt engineer 101

Do sentiment analysis for this “message” using options [“neutral”, “positive”, “negative”].

Aggregate Data

ก่อนที่เราจะเข้าสู่ Phase 3 [Present] ของโปรเจ็ค Social Listening เราจะทำการ Aggregate Data ให้อยู่ในรูปแบบ Table ที่เข้าใจง่ายๆก่อน
เครื่องมือที่เราใช้ทำ Aggregate Data ที่ง่ายที่สุดคือ Pivot Table ใน Excel/ Google Sheets หรือจะใช้ BI Tools ที่กำลังเป็นที่นิยมตอนนี้ เช่น Looker และ Power BI ก็ได้

Aggregate Data แปลไทยง่ายๆว่าการสรุปผลสถิติ

Type	Description
Dimension	ข้อมูลที่เป็น category เช่น ในตัวอย่างนี้เป็น sentiment
Measurement	ข้อมูลที่เป็น numeric เช่น ในตัวอย่างนี้เป็น Total Message, Total Engagement

สามารถดู Measurement ของ Like, Share, Comment and Total Engagement
รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

และใช้ Dimension กับ measurement ใน pivot บน excel หรือ google sheet จะง่ายที่สุด

สามารถดู Dimension ของ Channel, Sentiment, Category
รูปภาพจาก Social Listening — Data Analyst Edition by DataRockie

Present Data Visualization Basics

การนำเสนอข้อมูล Social Listening ใช้ความรู้พื้นฐาน Data Visualization ง่ายๆ

Charts ที่เรานิยมใช้กัน เช่น Line, Pie, Bar, Simple Table และ Word Cloud

Line Chart นิยมใช้แสดงผล Time Series เป็น chart หลักที่เราใช้ในการรายงานผล สอง metrics ที่เราดูเทรนด์ในทุกโปรเจ็คคือ message และ engagement

2. Simple Table ใช้แสดงข้อมูลง่ายๆ Users สามารถ copy data นำไปใช้ต่อได้สะดวก เช่น export ไปใส่ spreadsheets เพื่อสร้าง visualization ที่เค้าต้องการ

3. Bar Chart เป็นอีกหนึ่ง common chart ที่เราใช้ในทุกโปรเจ็ค เปลี่ยนข้อมูลใน table ให้กลายเป็น simple visualization แสดงผล categories ต่างๆ โดยทั่วไปจะเรียง category จากมากไปน้อย

4. Pie Chart ใช้แสดงข้อมูลแบบ categories ที่รวมกันได้ 100% (The Whole) ตัวอย่างนี้เราแบ่งกลุ่ม mentions ตามประเภทของ influencers เป็น pico, micro, nano และ mid-tier ตามจำนวน followers

**Pie Chart** for influencers categories

5. Word Cloud ใช้แสดงผลข้อความ phrase หรือ #hashtag ที่มีการ mentions เยอะๆในข้อมูล font size ยิ่งใหญ่ แปลว่ามีการพูดถึงคำนั้นเยอะ

**Word Cloud** สำหรับคำที่มีการพูดถึงเยอะ

Zocial Eye Dashboard มี common charts ทั้งหมดนี้ให้เราใช้งานได้เลย แต่ถ้าอยากจะ export data ออกมาทำเองแบบ manual ก็ได้เช่นกัน (หรือเอา raw data ไปขึ้น BI tools)

Report & Dashboard

Looker Studio – https://lookerstudio.google.com/

เราสามารถโหลดข้อมูลที่ได้จาก Social Listening Tools เข้า Business Intelligence (BI) Software เช่น Looker Studio, Power BI หรือ Tableau เพื่อทำ Report และ Dashboard เสนอผลวิเคราะห์

สำหรับ Program Google Looker Studio มีขั้นตอนดังนี้

Export file csv to looker studio
+ Blank Report page in looker studio
click to upload file
add to report
แล้วสามารถใส่ chart ตามใจชอบ

Slide Making

ทีม Data Analyst ใช้ Slide Presentation ในการนำเสนอ Insights ที่น่าสนใจ

Software ที่เราใช้ทำ Slide Presentation เช่น

PowerPoint (Windows)
Keynote (Mac OS)
Google Slides (Web-Based)
Canva (Web-Based)

Tip – หลายคนอาจจะไม่รู้ แต่ Slide Making คืออีกหนึ่งทักษะสำคัญของการเป็น Data Analyst ที่ดีเลย หรือที่เราเรียกกันว่า Communication Skill (ใน Job Description/ Requirement)

ข้อจำกัดของข้อมูล Social Listening ที่เราเก็บเข้ามาในระบบ

เราไม่สามารถแยกระหว่าง Organic vs. Paid ได้ นอกจากเราจะเป็นเจ้าของ Page/ Channel นั้นๆถึงจะรู้ว่าโพสต์ไหนบ้างใช้เงินอัดโฆษณา

เรื่องการ Update ระบบ Social Listening จะมีรอบในการวิ่งกลับไปเก็บข้อมูลล่าสุด เช่น Post วันที่ 1 ก.ย. 2023 จะมีการวิ่งกลับไปเก็บอีกสองครั้งวันที่ 8 และ 15 ก.ย. 2023 หลังจากนั้น ระบบจะหยุดเก็บข้อมูลแล้ว ทำให้ตัวเลขบน Social Media กับใน Report อาจจะไม่ตรงกันเป๊ะ 100%

Social media platforms บางตัวยังไม่เปิดให้ใช้งาน Public API เช่น TikTok (อัพเดท ต.ค. 2023) หรือมี API ให้ใช้ แต่ไม่ส่งค่าบางอย่างกลับมาเช่น Facebook API ยังไม่แชร์ค่า Video Views บน Page กลับมาให้เราวิเคราะห์ต่อ ต้องไปดึงแบบ manual เอง
facebook ยังเก็บบาง metric ไม่ได้เช่น ยอดวิวของวิดีโอ
⚠️⚠️⚠️Social Listening ตอนนี้จะเก็บข้อมูลจาก Public Profile เท่านั้น ถ้าเป็น Account ส่วนตัวหรือ Private Post ที่ปิดกั้นการมองเห็น รวมถึง Ads ต่างๆ ด้วยเทคโนโลยีตอนนี้ ระบบจะยังเก็บข้อมูลมาไม่ได้นะครับ

Summary

รุปสำหรับ 3 ขั้นตอนในการทำโปรเจ็ค Social Listening

Extract
Analyze (+Clean)
Present

หวังว่าบทความเรื่อง Social Listening นี้จะเป็นประโยชน์กับทุกท่าน

ขอบคุณคอร์ส Social Listening ฟรีจาก DataRockie : https://data-science-bootcamp1.teachable.com/courses/enrolled/2121068

December 17, 2024

Visualizations Pane Type	Definition
Report View	The Dashboard Report
Table View	Dataset Details
Model View	Data Relationship Model
DAX Query View	DAX Query Editor

Axis	Column of Data File
Y-axis	Sum Cookies shipped
X-axis	Order Date

Axis	Column Data Files
X-Axis	Order date by day
Y-Axis	Quantity

Axis	Column Final Files
X-Axis	Order Date
Y-Axis	Quantity

Axis	Column Final Files
Y-Axis	Count of Reorder level
X-Axis	Supplier